Overview
Brought to you by YData
Dataset statistics
| Number of variables | 96 |
|---|---|
| Number of observations | 584592 |
| Missing cells | 16055219 |
| Missing cells (%) | 28.6% |
| Total size in memory | 428.2 MiB |
| Average record size in memory | 768.0 B |
Variable types
| Text | 96 |
|---|
Dataset
| Description | Birds NMNH Extant Specimen Records 0054887-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.2en7ue |
license has constant value "CC0_1_0" | Constant |
publisher has constant value "National Museum of Natural History, Smithsonian Institution" | Constant |
institutionID has constant value "urn:lsid:biocol.org:col:34871" | Constant |
collectionID has constant value "urn:uuid:73d83e23-1999-42cd-b38a-c06a7d32d893" | Constant |
institutionCode has constant value "USNM" | Constant |
collectionCode has constant value "BIRDS" | Constant |
datasetName has constant value "NMNH Extant Biology" | Constant |
basisOfRecord has constant value "PRESERVED_SPECIMEN" | Constant |
occurrenceStatus has constant value "PRESENT" | Constant |
kingdom has constant value "Animalia" | Constant |
phylum has constant value "Chordata" | Constant |
class has constant value "Aves" | Constant |
datasetKey has constant value "821cc27a-e3bb-4bc5-ac34-89ada245069d" | Constant |
publishingCountry has constant value "US" | Constant |
kingdomKey has constant value "1" | Constant |
phylumKey has constant value "44" | Constant |
classKey has constant value "212" | Constant |
protocol has constant value "EML" | Constant |
lastCrawled has constant value "2024-12-02T11:48:23.416Z" | Constant |
publishedByGbifRegion has constant value "NORTH_AMERICA" | Constant |
recordNumber has 584474 (> 99.9%) missing values | Missing |
recordedBy has 7123 (1.2%) missing values | Missing |
sex has 112304 (19.2%) missing values | Missing |
lifeStage has 459507 (78.6%) missing values | Missing |
associatedSequences has 580105 (99.2%) missing values | Missing |
occurrenceRemarks has 572414 (97.9%) missing values | Missing |
eventDate has 41361 (7.1%) missing values | Missing |
startDayOfYear has 74069 (12.7%) missing values | Missing |
endDayOfYear has 74069 (12.7%) missing values | Missing |
year has 41376 (7.1%) missing values | Missing |
month has 53877 (9.2%) missing values | Missing |
day has 74434 (12.7%) missing values | Missing |
verbatimEventDate has 235442 (40.3%) missing values | Missing |
habitat has 567355 (97.1%) missing values | Missing |
continent has 27500 (4.7%) missing values | Missing |
waterBody has 558515 (95.5%) missing values | Missing |
stateProvince has 93871 (16.1%) missing values | Missing |
county has 353572 (60.5%) missing values | Missing |
locality has 107551 (18.4%) missing values | Missing |
verbatimElevation has 583323 (99.8%) missing values | Missing |
decimalLatitude has 556566 (95.2%) missing values | Missing |
decimalLongitude has 556566 (95.2%) missing values | Missing |
verbatimCoordinateSystem has 567281 (97.0%) missing values | Missing |
georeferenceProtocol has 583342 (99.8%) missing values | Missing |
identificationQualifier has 583894 (99.9%) missing values | Missing |
typeStatus has 580632 (99.3%) missing values | Missing |
identifiedBy has 581206 (99.4%) missing values | Missing |
specificEpithet has 7917 (1.4%) missing values | Missing |
infraspecificEpithet has 308675 (52.8%) missing values | Missing |
elevation has 498000 (85.2%) missing values | Missing |
elevationAccuracy has 574752 (98.3%) missing values | Missing |
distanceFromCentroidInMeters has 584584 (> 99.9%) missing values | Missing |
mediaType has 26095 (4.5%) missing values | Missing |
speciesKey has 7853 (1.3%) missing values | Missing |
species has 7853 (1.3%) missing values | Missing |
gbifRegion has 19462 (3.3%) missing values | Missing |
level0Gid has 562100 (96.2%) missing values | Missing |
level0Name has 562100 (96.2%) missing values | Missing |
level1Gid has 562129 (96.2%) missing values | Missing |
level1Name has 562129 (96.2%) missing values | Missing |
level2Gid has 562935 (96.3%) missing values | Missing |
level2Name has 563182 (96.3%) missing values | Missing |
level3Gid has 575359 (98.4%) missing values | Missing |
level3Name has 576369 (98.6%) missing values | Missing |
iucnRedListCategory has 273793 (46.8%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
catalogNumber has unique values | Unique |
Reproduction
| Analysis started | 2025-01-08 22:54:32.986576 |
|---|---|
| Analysis finished | 2025-01-08 22:54:56.133958 |
| Duration | 23.15 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 584592 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 584592 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 4601228301 |
|---|---|
| 2nd row | 1317203661 |
| 3rd row | 1322538154 |
| 4th row | 1317205864 |
| 5th row | 1317207704 |
| Value | Count | Frequency (%) |
| 4601228301 | 1 | < 0.1% |
| 1322540164 | 1 | < 0.1% |
| 1322550508 | 1 | < 0.1% |
| 1317268099 | 1 | < 0.1% |
| 1317208553 | 1 | < 0.1% |
| 1322538154 | 1 | < 0.1% |
| 1317205864 | 1 | < 0.1% |
| 1317207704 | 1 | < 0.1% |
| 1317208071 | 1 | < 0.1% |
| 1317232225 | 1 | < 0.1% |
| Other values (584582) | 584582 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1203296 | |
| 3 | 890257 | |
| 2 | 755814 | |
| 9 | 472136 | 8.1% |
| 0 | 451322 | 7.7% |
| 8 | 446725 | 7.6% |
| 7 | 431175 | 7.4% |
| 5 | 410207 | 7.0% |
| 4 | 403288 | 6.9% |
| 6 | 381700 | 6.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5845920 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1203296 | |
| 3 | 890257 | |
| 2 | 755814 | |
| 9 | 472136 | 8.1% |
| 0 | 451322 | 7.7% |
| 8 | 446725 | 7.6% |
| 7 | 431175 | 7.4% |
| 5 | 410207 | 7.0% |
| 4 | 403288 | 6.9% |
| 6 | 381700 | 6.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5845920 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1203296 | |
| 3 | 890257 | |
| 2 | 755814 | |
| 9 | 472136 | 8.1% |
| 0 | 451322 | 7.7% |
| 8 | 446725 | 7.6% |
| 7 | 431175 | 7.4% |
| 5 | 410207 | 7.0% |
| 4 | 403288 | 6.9% |
| 6 | 381700 | 6.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5845920 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1203296 | |
| 3 | 890257 | |
| 2 | 755814 | |
| 9 | 472136 | 8.1% |
| 0 | 451322 | 7.7% |
| 8 | 446725 | 7.6% |
| 7 | 431175 | 7.4% |
| 5 | 410207 | 7.0% |
| 4 | 403288 | 6.9% |
| 6 | 381700 | 6.5% |
license
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CC0_1_0 |
|---|---|
| 2nd row | CC0_1_0 |
| 3rd row | CC0_1_0 |
| 4th row | CC0_1_0 |
| 5th row | CC0_1_0 |
| Value | Count | Frequency (%) |
| cc0_1_0 | 584592 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 1169184 | |
| 0 | 1169184 | |
| _ | 1169184 | |
| 1 | 584592 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1753776 | |
| Uppercase Letter | 1169184 | |
| Connector Punctuation | 1169184 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1169184 | |
| 1 | 584592 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1169184 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1169184 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2922960 | |
| Latin | 1169184 | 28.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1169184 | |
| _ | 1169184 | |
| 1 | 584592 |
Latin
| Value | Count | Frequency (%) |
| C | 1169184 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4092144 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 1169184 | |
| 0 | 1169184 | |
| _ | 1169184 | |
| 1 | 584592 |
modified
Text
| Distinct | 11792 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Unique
| Unique | 4737 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | 2024-03-26T12:49:00Z |
|---|---|
| 2nd row | 2022-07-12T14:29:00Z |
| 3rd row | 2022-04-29T16:16:00Z |
| 4th row | 2022-04-05T14:20:00Z |
| 5th row | 2022-09-22T21:27:00Z |
| Value | Count | Frequency (%) |
| 2024-09-19t15:58:00z | 8050 | 1.4% |
| 2024-09-19t15:59:00z | 7282 | 1.2% |
| 2024-09-19t15:57:00z | 6771 | 1.2% |
| 2024-11-12t09:38:00z | 6108 | 1.0% |
| 2024-09-19t15:43:00z | 3407 | 0.6% |
| 2024-09-19t16:00:00z | 2927 | 0.5% |
| 2022-09-22t21:42:00z | 2178 | 0.4% |
| 2022-09-22t21:59:00z | 2177 | 0.4% |
| 2022-09-22t20:03:00z | 2168 | 0.4% |
| 2022-09-22t21:51:00z | 2164 | 0.4% |
| Other values (11782) | 541360 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2854206 | |
| 0 | 2776194 | |
| - | 1169184 | |
| : | 1169184 | |
| 1 | 800219 | 6.8% |
| T | 584592 | 5.0% |
| Z | 584592 | 5.0% |
| 9 | 465374 | 4.0% |
| 4 | 411253 | 3.5% |
| 5 | 256891 | 2.2% |
| Other values (4) | 620151 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8184288 | |
| Dash Punctuation | 1169184 | 10.0% |
| Other Punctuation | 1169184 | 10.0% |
| Uppercase Letter | 1169184 | 10.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2854206 | |
| 0 | 2776194 | |
| 1 | 800219 | 9.8% |
| 9 | 465374 | 5.7% |
| 4 | 411253 | 5.0% |
| 5 | 256891 | 3.1% |
| 3 | 228479 | 2.8% |
| 7 | 147850 | 1.8% |
| 8 | 125685 | 1.5% |
| 6 | 118137 | 1.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 584592 | |
| Z | 584592 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1169184 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1169184 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10522656 | |
| Latin | 1169184 | 10.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2854206 | |
| 0 | 2776194 | |
| - | 1169184 | |
| : | 1169184 | |
| 1 | 800219 | 7.6% |
| 9 | 465374 | 4.4% |
| 4 | 411253 | 3.9% |
| 5 | 256891 | 2.4% |
| 3 | 228479 | 2.2% |
| 7 | 147850 | 1.4% |
| Other values (2) | 243822 | 2.3% |
Latin
| Value | Count | Frequency (%) |
| T | 584592 | |
| Z | 584592 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11691840 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2854206 | |
| 0 | 2776194 | |
| - | 1169184 | |
| : | 1169184 | |
| 1 | 800219 | 6.8% |
| T | 584592 | 5.0% |
| Z | 584592 | 5.0% |
| 9 | 465374 | 4.0% |
| 4 | 411253 | 3.5% |
| 5 | 256891 | 2.2% |
| Other values (4) | 620151 | 5.3% |
publisher
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 59 |
|---|---|
| Median length | 59 |
| Mean length | 59 |
| Min length | 59 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | National Museum of Natural History, Smithsonian Institution |
|---|---|
| 2nd row | National Museum of Natural History, Smithsonian Institution |
| 3rd row | National Museum of Natural History, Smithsonian Institution |
| 4th row | National Museum of Natural History, Smithsonian Institution |
| 5th row | National Museum of Natural History, Smithsonian Institution |
| Value | Count | Frequency (%) |
| national | 584592 | |
| museum | 584592 | |
| of | 584592 | |
| natural | 584592 | |
| history | 584592 | |
| smithsonian | 584592 | |
| institution | 584592 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 4092144 | |
| i | 3507552 | |
| 3507552 | ||
| a | 2922960 | 8.5% |
| o | 2922960 | 8.5% |
| n | 2922960 | 8.5% |
| s | 2338368 | 6.8% |
| u | 2338368 | 6.8% |
| r | 1169184 | 3.4% |
| m | 1169184 | 3.4% |
| Other values (11) | 7599696 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 26891232 | |
| Space Separator | 3507552 | 10.2% |
| Uppercase Letter | 3507552 | 10.2% |
| Other Punctuation | 584592 | 1.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 4092144 | |
| i | 3507552 | |
| a | 2922960 | |
| o | 2922960 | |
| n | 2922960 | |
| s | 2338368 | |
| u | 2338368 | |
| r | 1169184 | 4.3% |
| m | 1169184 | 4.3% |
| l | 1169184 | 4.3% |
| Other values (4) | 2338368 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1169184 | |
| M | 584592 | |
| H | 584592 | |
| S | 584592 | |
| I | 584592 |
Space Separator
| Value | Count | Frequency (%) |
| 3507552 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 584592 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 30398784 | |
| Common | 4092144 | 11.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 4092144 | |
| i | 3507552 | |
| a | 2922960 | |
| o | 2922960 | |
| n | 2922960 | |
| s | 2338368 | 7.7% |
| u | 2338368 | 7.7% |
| r | 1169184 | 3.8% |
| m | 1169184 | 3.8% |
| N | 1169184 | 3.8% |
| Other values (9) | 5845920 |
Common
| Value | Count | Frequency (%) |
| 3507552 | ||
| , | 584592 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34490928 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 4092144 | |
| i | 3507552 | |
| 3507552 | ||
| a | 2922960 | 8.5% |
| o | 2922960 | 8.5% |
| n | 2922960 | 8.5% |
| s | 2338368 | 6.8% |
| u | 2338368 | 6.8% |
| r | 1169184 | 3.4% |
| m | 1169184 | 3.4% |
| Other values (11) | 7599696 |
institutionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 29 |
| Mean length | 29 |
| Min length | 29 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:lsid:biocol.org:col:34871 |
|---|---|
| 2nd row | urn:lsid:biocol.org:col:34871 |
| 3rd row | urn:lsid:biocol.org:col:34871 |
| 4th row | urn:lsid:biocol.org:col:34871 |
| 5th row | urn:lsid:biocol.org:col:34871 |
| Value | Count | Frequency (%) |
| urn:lsid:biocol.org:col:34871 | 584592 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2338368 | |
| : | 2338368 | |
| l | 1753776 | 10.3% |
| i | 1169184 | 6.9% |
| r | 1169184 | 6.9% |
| c | 1169184 | 6.9% |
| g | 584592 | 3.4% |
| 7 | 584592 | 3.4% |
| 8 | 584592 | 3.4% |
| 4 | 584592 | 3.4% |
| Other values (8) | 4676736 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11107248 | |
| Other Punctuation | 2922960 | 17.2% |
| Decimal Number | 2922960 | 17.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 2338368 | |
| l | 1753776 | |
| i | 1169184 | |
| r | 1169184 | |
| c | 1169184 | |
| g | 584592 | 5.3% |
| u | 584592 | 5.3% |
| b | 584592 | 5.3% |
| d | 584592 | 5.3% |
| s | 584592 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 584592 | |
| 8 | 584592 | |
| 4 | 584592 | |
| 3 | 584592 | |
| 1 | 584592 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2338368 | |
| . | 584592 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11107248 | |
| Common | 5845920 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 2338368 | |
| l | 1753776 | |
| i | 1169184 | |
| r | 1169184 | |
| c | 1169184 | |
| g | 584592 | 5.3% |
| u | 584592 | 5.3% |
| b | 584592 | 5.3% |
| d | 584592 | 5.3% |
| s | 584592 | 5.3% |
Common
| Value | Count | Frequency (%) |
| : | 2338368 | |
| 7 | 584592 | 10.0% |
| 8 | 584592 | 10.0% |
| 4 | 584592 | 10.0% |
| 3 | 584592 | 10.0% |
| . | 584592 | 10.0% |
| 1 | 584592 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16953168 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 2338368 | |
| : | 2338368 | |
| l | 1753776 | 10.3% |
| i | 1169184 | 6.9% |
| r | 1169184 | 6.9% |
| c | 1169184 | 6.9% |
| g | 584592 | 3.4% |
| 7 | 584592 | 3.4% |
| 8 | 584592 | 3.4% |
| 4 | 584592 | 3.4% |
| Other values (8) | 4676736 |
collectionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 45 |
| Mean length | 45 |
| Min length | 45 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:uuid:73d83e23-1999-42cd-b38a-c06a7d32d893 |
|---|---|
| 2nd row | urn:uuid:73d83e23-1999-42cd-b38a-c06a7d32d893 |
| 3rd row | urn:uuid:73d83e23-1999-42cd-b38a-c06a7d32d893 |
| 4th row | urn:uuid:73d83e23-1999-42cd-b38a-c06a7d32d893 |
| 5th row | urn:uuid:73d83e23-1999-42cd-b38a-c06a7d32d893 |
| Value | Count | Frequency (%) |
| urn:uuid:73d83e23-1999-42cd-b38a-c06a7d32d893 | 584592 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 3507552 | |
| d | 2922960 | |
| 9 | 2338368 | 8.9% |
| - | 2338368 | 8.9% |
| u | 1753776 | 6.7% |
| 8 | 1753776 | 6.7% |
| 2 | 1753776 | 6.7% |
| 7 | 1169184 | 4.4% |
| : | 1169184 | 4.4% |
| c | 1169184 | 4.4% |
| Other values (10) | 6430512 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12861024 | |
| Lowercase Letter | 9938064 | |
| Dash Punctuation | 2338368 | 8.9% |
| Other Punctuation | 1169184 | 4.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 3507552 | |
| 9 | 2338368 | |
| 8 | 1753776 | |
| 2 | 1753776 | |
| 7 | 1169184 | 9.1% |
| 1 | 584592 | 4.5% |
| 4 | 584592 | 4.5% |
| 0 | 584592 | 4.5% |
| 6 | 584592 | 4.5% |
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 2922960 | |
| u | 1753776 | |
| c | 1169184 | 11.8% |
| a | 1169184 | 11.8% |
| i | 584592 | 5.9% |
| e | 584592 | 5.9% |
| r | 584592 | 5.9% |
| n | 584592 | 5.9% |
| b | 584592 | 5.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2338368 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1169184 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 16368576 | |
| Latin | 9938064 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 3507552 | |
| 9 | 2338368 | |
| - | 2338368 | |
| 8 | 1753776 | |
| 2 | 1753776 | |
| 7 | 1169184 | 7.1% |
| : | 1169184 | 7.1% |
| 1 | 584592 | 3.6% |
| 4 | 584592 | 3.6% |
| 0 | 584592 | 3.6% |
Latin
| Value | Count | Frequency (%) |
| d | 2922960 | |
| u | 1753776 | |
| c | 1169184 | 11.8% |
| a | 1169184 | 11.8% |
| i | 584592 | 5.9% |
| e | 584592 | 5.9% |
| r | 584592 | 5.9% |
| n | 584592 | 5.9% |
| b | 584592 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26306640 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 3507552 | |
| d | 2922960 | |
| 9 | 2338368 | 8.9% |
| - | 2338368 | 8.9% |
| u | 1753776 | 6.7% |
| 8 | 1753776 | 6.7% |
| 2 | 1753776 | 6.7% |
| 7 | 1169184 | 4.4% |
| : | 1169184 | 4.4% |
| c | 1169184 | 4.4% |
| Other values (10) | 6430512 |
institutionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | USNM |
|---|---|
| 2nd row | USNM |
| 3rd row | USNM |
| 4th row | USNM |
| 5th row | USNM |
| Value | Count | Frequency (%) |
| usnm | 584592 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 584592 | |
| S | 584592 | |
| N | 584592 | |
| M | 584592 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2338368 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 584592 | |
| S | 584592 | |
| N | 584592 | |
| M | 584592 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2338368 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 584592 | |
| S | 584592 | |
| N | 584592 | |
| M | 584592 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2338368 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 584592 | |
| S | 584592 | |
| N | 584592 | |
| M | 584592 |
collectionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BIRDS |
|---|---|
| 2nd row | BIRDS |
| 3rd row | BIRDS |
| 4th row | BIRDS |
| 5th row | BIRDS |
| Value | Count | Frequency (%) |
| birds | 584592 |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 584592 | |
| I | 584592 | |
| R | 584592 | |
| D | 584592 | |
| S | 584592 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2922960 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 584592 | |
| I | 584592 | |
| R | 584592 | |
| D | 584592 | |
| S | 584592 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2922960 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 584592 | |
| I | 584592 | |
| R | 584592 | |
| D | 584592 | |
| S | 584592 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2922960 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| B | 584592 | |
| I | 584592 | |
| R | 584592 | |
| D | 584592 | |
| S | 584592 |
datasetName
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NMNH Extant Biology |
|---|---|
| 2nd row | NMNH Extant Biology |
| 3rd row | NMNH Extant Biology |
| 4th row | NMNH Extant Biology |
| 5th row | NMNH Extant Biology |
| Value | Count | Frequency (%) |
| nmnh | 584592 | |
| extant | 584592 | |
| biology | 584592 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1169184 | 10.5% |
| 1169184 | 10.5% | |
| t | 1169184 | 10.5% |
| o | 1169184 | 10.5% |
| M | 584592 | 5.3% |
| H | 584592 | 5.3% |
| E | 584592 | 5.3% |
| x | 584592 | 5.3% |
| a | 584592 | 5.3% |
| n | 584592 | 5.3% |
| Other values (5) | 2922960 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6430512 | |
| Uppercase Letter | 3507552 | |
| Space Separator | 1169184 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1169184 | |
| o | 1169184 | |
| x | 584592 | |
| a | 584592 | |
| n | 584592 | |
| i | 584592 | |
| l | 584592 | |
| g | 584592 | |
| y | 584592 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1169184 | |
| M | 584592 | |
| H | 584592 | |
| E | 584592 | |
| B | 584592 |
Space Separator
| Value | Count | Frequency (%) |
| 1169184 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9938064 | |
| Common | 1169184 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 1169184 | |
| t | 1169184 | |
| o | 1169184 | |
| M | 584592 | 5.9% |
| H | 584592 | 5.9% |
| E | 584592 | 5.9% |
| x | 584592 | 5.9% |
| a | 584592 | 5.9% |
| n | 584592 | 5.9% |
| B | 584592 | 5.9% |
| Other values (4) | 2338368 |
Common
| Value | Count | Frequency (%) |
| 1169184 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11107248 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 1169184 | 10.5% |
| 1169184 | 10.5% | |
| t | 1169184 | 10.5% |
| o | 1169184 | 10.5% |
| M | 584592 | 5.3% |
| H | 584592 | 5.3% |
| E | 584592 | 5.3% |
| x | 584592 | 5.3% |
| a | 584592 | 5.3% |
| n | 584592 | 5.3% |
| Other values (5) | 2922960 |
basisOfRecord
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 18 |
| Min length | 18 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESERVED_SPECIMEN |
|---|---|
| 2nd row | PRESERVED_SPECIMEN |
| 3rd row | PRESERVED_SPECIMEN |
| 4th row | PRESERVED_SPECIMEN |
| 5th row | PRESERVED_SPECIMEN |
| Value | Count | Frequency (%) |
| preserved_specimen | 584592 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 2922960 | |
| P | 1169184 | 11.1% |
| R | 1169184 | 11.1% |
| S | 1169184 | 11.1% |
| V | 584592 | 5.6% |
| D | 584592 | 5.6% |
| _ | 584592 | 5.6% |
| C | 584592 | 5.6% |
| I | 584592 | 5.6% |
| M | 584592 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 9938064 | |
| Connector Punctuation | 584592 | 5.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 2922960 | |
| P | 1169184 | 11.8% |
| R | 1169184 | 11.8% |
| S | 1169184 | 11.8% |
| V | 584592 | 5.9% |
| D | 584592 | 5.9% |
| C | 584592 | 5.9% |
| I | 584592 | 5.9% |
| M | 584592 | 5.9% |
| N | 584592 | 5.9% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 584592 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9938064 | |
| Common | 584592 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 2922960 | |
| P | 1169184 | 11.8% |
| R | 1169184 | 11.8% |
| S | 1169184 | 11.8% |
| V | 584592 | 5.9% |
| D | 584592 | 5.9% |
| C | 584592 | 5.9% |
| I | 584592 | 5.9% |
| M | 584592 | 5.9% |
| N | 584592 | 5.9% |
Common
| Value | Count | Frequency (%) |
| _ | 584592 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10522656 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 2922960 | |
| P | 1169184 | 11.1% |
| R | 1169184 | 11.1% |
| S | 1169184 | 11.1% |
| V | 584592 | 5.6% |
| D | 584592 | 5.6% |
| _ | 584592 | 5.6% |
| C | 584592 | 5.6% |
| I | 584592 | 5.6% |
| M | 584592 | 5.6% |
occurrenceID
Text
Unique 
| Distinct | 584592 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 63 |
| Mean length | 63 |
| Min length | 63 |
Unique
| Unique | 584592 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | http://n2t.net/ark:/65665/300075fa7-edd1-461a-9f08-e6ba501db28c |
|---|---|
| 2nd row | http://n2t.net/ark:/65665/3000df15d-8cee-4e97-92ce-bb2a2eabd590 |
| 3rd row | http://n2t.net/ark:/65665/3ec08151f-42be-49b5-868b-d3deeddbd447 |
| 4th row | http://n2t.net/ark:/65665/30026d668-b659-45a3-8494-25f389913e98 |
| 5th row | http://n2t.net/ark:/65665/3003b6dd3-df37-400f-8ae6-e515ea9c2d04 |
| Value | Count | Frequency (%) |
| http://n2t.net/ark:/65665/300075fa7-edd1-461a-9f08-e6ba501db28c | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ec1dbc05-3709-4356-a820-34fb80d5a314 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ec937490-e545-4db6-812d-bbcfe6057996 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/302e7b9b3-e03c-4d08-a4a5-3110143884c6 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3004420cd-5dd8-4d0b-bb81-5df504988ccf | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ec08151f-42be-49b5-868b-d3deeddbd447 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/30026d668-b659-45a3-8494-25f389913e98 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3003b6dd3-df37-400f-8ae6-e515ea9c2d04 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3003f1ccb-ef9c-4862-9369-5c82ac27e83e | 1 | < 0.1% |
| http://n2t.net/ark:/65665/30150f58d-26d0-475b-b905-a9bb8e072667 | 1 | < 0.1% |
| Other values (584582) | 584582 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 2922960 | 7.9% |
| 6 | 2852045 | 7.7% |
| - | 2338368 | 6.3% |
| t | 2338368 | 6.3% |
| 5 | 2265649 | 6.2% |
| a | 1827322 | 5.0% |
| 2 | 1681511 | 4.6% |
| 3 | 1680550 | 4.6% |
| e | 1680227 | 4.6% |
| 4 | 1679992 | 4.6% |
| Other values (16) | 15562304 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15931516 | |
| Lowercase Letter | 13882676 | |
| Other Punctuation | 4676736 | 12.7% |
| Dash Punctuation | 2338368 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2338368 | |
| a | 1827322 | |
| e | 1680227 | |
| b | 1241614 | |
| n | 1169184 | |
| c | 1097145 | |
| f | 1096769 | |
| d | 1093679 | |
| k | 584592 | 4.2% |
| r | 584592 | 4.2% |
| Other values (2) | 1169184 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 2852045 | |
| 5 | 2265649 | |
| 2 | 1681511 | |
| 3 | 1680550 | |
| 4 | 1679992 | |
| 8 | 1243182 | |
| 9 | 1240698 | |
| 1 | 1096282 | 6.9% |
| 7 | 1096019 | 6.9% |
| 0 | 1095588 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 2922960 | |
| : | 1169184 | 25.0% |
| . | 584592 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2338368 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 22946620 | |
| Latin | 13882676 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 2922960 | |
| 6 | 2852045 | |
| - | 2338368 | |
| 5 | 2265649 | |
| 2 | 1681511 | |
| 3 | 1680550 | |
| 4 | 1679992 | |
| 8 | 1243182 | 5.4% |
| 9 | 1240698 | 5.4% |
| : | 1169184 | 5.1% |
| Other values (4) | 3872481 |
Latin
| Value | Count | Frequency (%) |
| t | 2338368 | |
| a | 1827322 | |
| e | 1680227 | |
| b | 1241614 | |
| n | 1169184 | |
| c | 1097145 | |
| f | 1096769 | |
| d | 1093679 | |
| k | 584592 | 4.2% |
| r | 584592 | 4.2% |
| Other values (2) | 1169184 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36829296 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 2922960 | 7.9% |
| 6 | 2852045 | 7.7% |
| - | 2338368 | 6.3% |
| t | 2338368 | 6.3% |
| 5 | 2265649 | 6.2% |
| a | 1827322 | 5.0% |
| 2 | 1681511 | 4.6% |
| 3 | 1680550 | 4.6% |
| e | 1680227 | 4.6% |
| 4 | 1679992 | 4.6% |
| Other values (16) | 15562304 |
catalogNumber
Text
Unique 
| Distinct | 584592 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.92067972 |
| Min length | 6 |
Unique
| Unique | 584592 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | USNM A16396 |
|---|---|
| 2nd row | USNM 101402 |
| 3rd row | USNM B28085 |
| 4th row | USNM 289875 |
| 5th row | USNM 562118 |
| Value | Count | Frequency (%) |
| usnm | 584592 | |
| 438818 | 1 | < 0.1% |
| 160226 | 1 | < 0.1% |
| 540920 | 1 | < 0.1% |
| 400497 | 1 | < 0.1% |
| b28085 | 1 | < 0.1% |
| 289875 | 1 | < 0.1% |
| 562118 | 1 | < 0.1% |
| b42715 | 1 | < 0.1% |
| 378552 | 1 | < 0.1% |
| Other values (584583) | 584583 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 584592 | 9.2% |
| S | 584592 | 9.2% |
| N | 584592 | 9.2% |
| M | 584592 | 9.2% |
| 584592 | 9.2% | |
| 3 | 396623 | 6.2% |
| 4 | 396155 | 6.2% |
| 5 | 388165 | 6.1% |
| 1 | 387443 | 6.1% |
| 2 | 382727 | 6.0% |
| Other values (7) | 1510069 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3420292 | |
| Uppercase Letter | 2379258 | |
| Space Separator | 584592 | 9.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 396623 | |
| 4 | 396155 | |
| 5 | 388165 | |
| 1 | 387443 | |
| 2 | 382727 | |
| 6 | 326899 | |
| 0 | 287859 | |
| 9 | 286088 | |
| 8 | 284189 | |
| 7 | 284144 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 584592 | |
| S | 584592 | |
| N | 584592 | |
| M | 584592 | |
| B | 34602 | 1.5% |
| A | 6288 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 584592 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4004884 | |
| Latin | 2379258 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 584592 | ||
| 3 | 396623 | |
| 4 | 396155 | |
| 5 | 388165 | |
| 1 | 387443 | |
| 2 | 382727 | |
| 6 | 326899 | |
| 0 | 287859 | |
| 9 | 286088 | |
| 8 | 284189 |
Latin
| Value | Count | Frequency (%) |
| U | 584592 | |
| S | 584592 | |
| N | 584592 | |
| M | 584592 | |
| B | 34602 | 1.5% |
| A | 6288 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6384142 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 584592 | 9.2% |
| S | 584592 | 9.2% |
| N | 584592 | 9.2% |
| M | 584592 | 9.2% |
| 584592 | 9.2% | |
| 3 | 396623 | 6.2% |
| 4 | 396155 | 6.2% |
| 5 | 388165 | 6.1% |
| 1 | 387443 | 6.1% |
| 2 | 382727 | 6.0% |
| Other values (7) | 1510069 |
recordNumber
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 584474 |
| Missing (%) | > 99.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 1.059322034 |
| Min length | 1 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 2.5% |
Sample
| 1st row | l |
|---|---|
| 2nd row | l |
| 3rd row | du |
| 4th row | l |
| 5th row | l |
| Value | Count | Frequency (%) |
| l | 115 | |
| du | 1 | 0.8% |
| riley | 1 | 0.8% |
| sta | 1 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 116 | |
| d | 1 | 0.8% |
| u | 1 | 0.8% |
| r | 1 | 0.8% |
| i | 1 | 0.8% |
| e | 1 | 0.8% |
| y | 1 | 0.8% |
| s | 1 | 0.8% |
| t | 1 | 0.8% |
| a | 1 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 125 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 116 | |
| d | 1 | 0.8% |
| u | 1 | 0.8% |
| r | 1 | 0.8% |
| i | 1 | 0.8% |
| e | 1 | 0.8% |
| y | 1 | 0.8% |
| s | 1 | 0.8% |
| t | 1 | 0.8% |
| a | 1 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 125 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 116 | |
| d | 1 | 0.8% |
| u | 1 | 0.8% |
| r | 1 | 0.8% |
| i | 1 | 0.8% |
| e | 1 | 0.8% |
| y | 1 | 0.8% |
| s | 1 | 0.8% |
| t | 1 | 0.8% |
| a | 1 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 125 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 116 | |
| d | 1 | 0.8% |
| u | 1 | 0.8% |
| r | 1 | 0.8% |
| i | 1 | 0.8% |
| e | 1 | 0.8% |
| y | 1 | 0.8% |
| s | 1 | 0.8% |
| t | 1 | 0.8% |
| a | 1 | 0.8% |
recordedBy
Text
Missing 
| Distinct | 13250 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 7123 |
| Missing (%) | 1.2% |
| Memory size | 4.5 MiB |
Length
| Max length | 60 |
|---|---|
| Median length | 55 |
| Mean length | 11.76426613 |
| Min length | 1 |
Unique
| Unique | 6170 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | T. Page |
|---|---|
| 2nd row | C. Worthen |
| 3rd row | H. Lee |
| 4th row | C. Sperry |
| 5th row | C. Ross |
| Value | Count | Frequency (%) |
| a | 64567 | 4.8% |
| j | 60293 | 4.5% |
| e | 58464 | 4.4% |
| 56508 | 4.2% | |
| w | 52970 | 4.0% |
| h | 41937 | 3.1% |
| m | 37812 | 2.8% |
| c | 37330 | 2.8% |
| t | 32505 | 2.4% |
| wetmore | 32367 | 2.4% |
| Other values (7402) | 863863 |
Most occurring characters
| Value | Count | Frequency (%) |
| 761147 | 11.2% | |
| . | 558992 | 8.2% |
| e | 547336 | 8.1% |
| r | 485535 | 7.1% |
| o | 389498 | 5.7% |
| n | 353948 | 5.2% |
| a | 303496 | 4.5% |
| l | 299899 | 4.4% |
| i | 264364 | 3.9% |
| t | 245352 | 3.6% |
| Other values (55) | 2583932 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4115057 | |
| Uppercase Letter | 1287488 | 19.0% |
| Space Separator | 761147 | 11.2% |
| Other Punctuation | 622658 | 9.2% |
| Dash Punctuation | 3521 | 0.1% |
| Decimal Number | 2824 | < 0.1% |
| Open Punctuation | 402 | < 0.1% |
| Close Punctuation | 402 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 547336 | |
| r | 485535 | |
| o | 389498 | |
| n | 353948 | 8.6% |
| a | 303496 | 7.4% |
| l | 299899 | 7.3% |
| i | 264364 | 6.4% |
| t | 245352 | 6.0% |
| s | 161797 | 3.9% |
| c | 132938 | 3.2% |
| Other values (16) | 930894 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 117570 | 9.1% |
| C | 117514 | 9.1% |
| B | 99641 | 7.7% |
| A | 96140 | 7.5% |
| M | 90141 | 7.0% |
| H | 82765 | 6.4% |
| R | 77734 | 6.0% |
| P | 76833 | 6.0% |
| J | 69213 | 5.4% |
| S | 67116 | 5.2% |
| Other values (16) | 392821 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 558992 | |
| & | 56427 | 9.1% |
| , | 6619 | 1.1% |
| ' | 606 | 0.1% |
| ? | 13 | < 0.1% |
| / | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 1412 | |
| 1 | 708 | |
| 8 | 704 |
Space Separator
| Value | Count | Frequency (%) |
| 761147 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3521 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 402 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 402 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5402545 | |
| Common | 1390954 | 20.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 547336 | 10.1% |
| r | 485535 | 9.0% |
| o | 389498 | 7.2% |
| n | 353948 | 6.6% |
| a | 303496 | 5.6% |
| l | 299899 | 5.6% |
| i | 264364 | 4.9% |
| t | 245352 | 4.5% |
| s | 161797 | 3.0% |
| c | 132938 | 2.5% |
| Other values (42) | 2218382 |
Common
| Value | Count | Frequency (%) |
| 761147 | ||
| . | 558992 | |
| & | 56427 | 4.1% |
| , | 6619 | 0.5% |
| - | 3521 | 0.3% |
| 9 | 1412 | 0.1% |
| 1 | 708 | 0.1% |
| 8 | 704 | 0.1% |
| ' | 606 | < 0.1% |
| ( | 402 | < 0.1% |
| Other values (3) | 416 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6793499 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 761147 | 11.2% | |
| . | 558992 | 8.2% |
| e | 547336 | 8.1% |
| r | 485535 | 7.1% |
| o | 389498 | 5.7% |
| n | 353948 | 5.2% |
| a | 303496 | 4.5% |
| l | 299899 | 4.4% |
| i | 264364 | 3.9% |
| t | 245352 | 3.6% |
| Other values (55) | 2583932 |
individualCount
Text
| Distinct | 49 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 1 |
| Mean length | 1.001168336 |
| Min length | 1 |
Unique
| Unique | 14 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 4 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 558309 | |
| 2 | 6799 | 1.2% |
| 4 | 6794 | 1.2% |
| 3 | 6435 | 1.1% |
| 5 | 3136 | 0.5% |
| 6 | 1204 | 0.2% |
| 7 | 608 | 0.1% |
| 8 | 374 | 0.1% |
| 9 | 251 | < 0.1% |
| 10 | 169 | < 0.1% |
| Other values (39) | 513 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 559052 | |
| 2 | 6944 | 1.2% |
| 4 | 6855 | 1.2% |
| 3 | 6513 | 1.1% |
| 5 | 3191 | 0.5% |
| 6 | 1239 | 0.2% |
| 7 | 631 | 0.1% |
| 8 | 397 | 0.1% |
| 9 | 270 | < 0.1% |
| 0 | 183 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 585275 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 559052 | |
| 2 | 6944 | 1.2% |
| 4 | 6855 | 1.2% |
| 3 | 6513 | 1.1% |
| 5 | 3191 | 0.5% |
| 6 | 1239 | 0.2% |
| 7 | 631 | 0.1% |
| 8 | 397 | 0.1% |
| 9 | 270 | < 0.1% |
| 0 | 183 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 585275 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 559052 | |
| 2 | 6944 | 1.2% |
| 4 | 6855 | 1.2% |
| 3 | 6513 | 1.1% |
| 5 | 3191 | 0.5% |
| 6 | 1239 | 0.2% |
| 7 | 631 | 0.1% |
| 8 | 397 | 0.1% |
| 9 | 270 | < 0.1% |
| 0 | 183 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 585275 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 559052 | |
| 2 | 6944 | 1.2% |
| 4 | 6855 | 1.2% |
| 3 | 6513 | 1.1% |
| 5 | 3191 | 0.5% |
| 6 | 1239 | 0.2% |
| 7 | 631 | 0.1% |
| 8 | 397 | 0.1% |
| 9 | 270 | < 0.1% |
| 0 | 183 | < 0.1% |
sex
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 112304 |
| Missing (%) | 19.2% |
| Memory size | 4.5 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.817674809 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MALE |
|---|---|
| 2nd row | FEMALE |
| 3rd row | MALE |
| 4th row | MALE |
| 5th row | MALE |
| Value | Count | Frequency (%) |
| male | 279199 | |
| female | 193089 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 665377 | |
| M | 472288 | |
| A | 472288 | |
| L | 472288 | |
| F | 193089 | 8.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2275330 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 665377 | |
| M | 472288 | |
| A | 472288 | |
| L | 472288 | |
| F | 193089 | 8.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2275330 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 665377 | |
| M | 472288 | |
| A | 472288 | |
| L | 472288 | |
| F | 193089 | 8.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2275330 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 665377 | |
| M | 472288 | |
| A | 472288 | |
| L | 472288 | |
| F | 193089 | 8.5% |
lifeStage
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 459507 |
| Missing (%) | 78.6% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 5 |
| Mean length | 5.961034497 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Immature |
|---|---|
| 2nd row | Juvenile |
| 3rd row | Adult |
| 4th row | Adult |
| 5th row | Adult |
| Value | Count | Frequency (%) |
| adult | 81111 | |
| immature | 27828 | 22.2% |
| juvenile | 10762 | 8.6% |
| chick | 3709 | 3.0% |
| subadult | 1382 | 1.1% |
| embryo | 292 | 0.2% |
| nestling | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| u | 122465 | |
| t | 110322 | |
| l | 93256 | |
| d | 82493 | |
| A | 81111 | |
| m | 55948 | |
| e | 49353 | |
| a | 29210 | 3.9% |
| r | 28120 | 3.8% |
| I | 27828 | 3.7% |
| Other values (16) | 65530 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 620551 | |
| Uppercase Letter | 125085 | 16.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 122465 | |
| t | 110322 | |
| l | 93256 | |
| d | 82493 | |
| m | 55948 | |
| e | 49353 | |
| a | 29210 | 4.7% |
| r | 28120 | 4.5% |
| i | 14472 | 2.3% |
| n | 10763 | 1.7% |
| Other values (9) | 24149 | 3.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 81111 | |
| I | 27828 | 22.2% |
| J | 10762 | 8.6% |
| C | 3709 | 3.0% |
| S | 1382 | 1.1% |
| E | 292 | 0.2% |
| N | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 745636 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 122465 | |
| t | 110322 | |
| l | 93256 | |
| d | 82493 | |
| A | 81111 | |
| m | 55948 | |
| e | 49353 | |
| a | 29210 | 3.9% |
| r | 28120 | 3.8% |
| I | 27828 | 3.7% |
| Other values (16) | 65530 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 745636 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| u | 122465 | |
| t | 110322 | |
| l | 93256 | |
| d | 82493 | |
| A | 81111 | |
| m | 55948 | |
| e | 49353 | |
| a | 29210 | 3.9% |
| r | 28120 | 3.8% |
| I | 27828 | 3.7% |
| Other values (16) | 65530 |
occurrenceStatus
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESENT |
|---|---|
| 2nd row | PRESENT |
| 3rd row | PRESENT |
| 4th row | PRESENT |
| 5th row | PRESENT |
| Value | Count | Frequency (%) |
| present | 584592 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1169184 | |
| P | 584592 | |
| R | 584592 | |
| S | 584592 | |
| N | 584592 | |
| T | 584592 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4092144 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1169184 | |
| P | 584592 | |
| R | 584592 | |
| S | 584592 | |
| N | 584592 | |
| T | 584592 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4092144 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 1169184 | |
| P | 584592 | |
| R | 584592 | |
| S | 584592 | |
| N | 584592 | |
| T | 584592 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4092144 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 1169184 | |
| P | 584592 | |
| R | 584592 | |
| S | 584592 | |
| N | 584592 | |
| T | 584592 |
preparations
Text
| Distinct | 132 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 76 |
|---|---|
| Median length | 11 |
| Mean length | 11.71096126 |
| Min length | 6 |
Unique
| Unique | 39 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Skin: Whole |
|---|---|
| 2nd row | Skin: Whole |
| 3rd row | Egg(s) |
| 4th row | Skeleton: Whole |
| 5th row | Skeleton: Whole |
| Value | Count | Frequency (%) |
| whole | 535339 | |
| skin | 470355 | |
| skeleton | 58626 | 5.0% |
| egg(s | 33064 | 2.8% |
| fluid | 32579 | 2.8% |
| partial | 24616 | 2.1% |
| nest(s | 4794 | 0.4% |
| feather(s | 4784 | 0.4% |
| mounted | 1952 | 0.2% |
| clutch | 967 | 0.1% |
| Other values (7) | 2530 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 671417 | |
| l | 654016 | |
| o | 595917 | |
| 585020 | ||
| : | 562892 | |
| h | 541090 | |
| W | 535338 | |
| n | 532123 | |
| i | 529352 | |
| S | 529335 | |
| Other values (21) | 1109564 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4423169 | |
| Uppercase Letter | 1169248 | 17.1% |
| Space Separator | 585020 | 8.5% |
| Other Punctuation | 583343 | 8.5% |
| Open Punctuation | 42642 | 0.6% |
| Close Punctuation | 42642 | 0.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 671417 | |
| l | 654016 | |
| o | 595917 | |
| h | 541090 | |
| n | 532123 | |
| i | 529352 | |
| k | 528981 | |
| t | 96879 | 2.2% |
| g | 66128 | 1.5% |
| a | 55099 | 1.2% |
| Other values (8) | 152167 | 3.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 535338 | |
| S | 529335 | |
| F | 37363 | 3.2% |
| E | 33064 | 2.8% |
| P | 24615 | 2.1% |
| N | 4794 | 0.4% |
| M | 3399 | 0.3% |
| C | 1340 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 562892 | |
| ; | 20451 | 3.5% |
Space Separator
| Value | Count | Frequency (%) |
| 585020 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 42642 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 42642 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5592417 | |
| Common | 1253647 | 18.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 671417 | |
| l | 654016 | |
| o | 595917 | |
| h | 541090 | |
| W | 535338 | |
| n | 532123 | |
| i | 529352 | |
| S | 529335 | |
| k | 528981 | |
| t | 96879 | 1.7% |
| Other values (16) | 377969 |
Common
| Value | Count | Frequency (%) |
| 585020 | ||
| : | 562892 | |
| ( | 42642 | 3.4% |
| ) | 42642 | 3.4% |
| ; | 20451 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6846064 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 671417 | |
| l | 654016 | |
| o | 595917 | |
| 585020 | ||
| : | 562892 | |
| h | 541090 | |
| W | 535338 | |
| n | 532123 | |
| i | 529352 | |
| S | 529335 | |
| Other values (21) | 1109564 |
Missing 
| Distinct | 4430 |
|---|---|
| Distinct (%) | 98.7% |
| Missing | 580105 |
| Missing (%) | 99.2% |
| Memory size | 4.5 MiB |
Length
| Max length | 12558 |
|---|---|
| Median length | 49 |
| Mean length | 129.0780031 |
| Min length | 49 |
Unique
| Unique | 4421 ? |
|---|---|
| Unique (%) | 98.5% |
Sample
| 1st row | https://www.ncbi.nlm.nih.gov/gquery?term=KM080095 |
|---|---|
| 2nd row | https://www.ncbi.nlm.nih.gov/gquery?term=JQ176229 |
| 3rd row | https://www.ncbi.nlm.nih.gov/gquery?term=JQ173910 |
| 4th row | https://www.ncbi.nlm.nih.gov/gquery?term=KU722483 |
| 5th row | https://www.ncbi.nlm.nih.gov/gquery?term=FJ547617;https://www.ncbi.nlm.nih.gov/gquery?term=FJ547732;https://www.ncbi.nlm.nih.gov/gquery?term=FJ547781;https://www.ncbi.nlm.nih.gov/gquery?term=FJ547782 |
| Value | Count | Frequency (%) |
| https://www.ncbi.nlm.nih.gov/gquery?term=prjna521985 | 34 | 0.8% |
| https://www.ncbi.nlm.nih.gov/gquery?term=ay273835 | 10 | 0.2% |
| https://www.ncbi.nlm.nih.gov/gquery?term=ay273864 | 8 | 0.2% |
| https://www.ncbi.nlm.nih.gov/gquery?term=ay273832 | 3 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=fj207364 | 3 | 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=fj207374 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=mh778417 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=dq433197 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=fj207379 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=mt456681 | 1 | < 0.1% |
| Other values (4420) | 4420 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 46361 | 8.0% |
| / | 34770 | 6.0% |
| w | 34770 | 6.0% |
| n | 34770 | 6.0% |
| t | 34770 | 6.0% |
| h | 23180 | 4.0% |
| r | 23180 | 4.0% |
| e | 23180 | 4.0% |
| i | 23180 | 4.0% |
| m | 23180 | 4.0% |
| Other values (53) | 277832 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 359290 | |
| Other Punctuation | 111414 | 19.2% |
| Decimal Number | 71114 | 12.3% |
| Uppercase Letter | 25344 | 4.4% |
| Math Symbol | 11590 | 2.0% |
| Dash Punctuation | 420 | 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| K | 4122 | |
| J | 3684 | |
| Q | 3175 | |
| U | 2477 | |
| E | 1468 | 5.8% |
| R | 1383 | 5.5% |
| M | 1361 | 5.4% |
| F | 1128 | 4.5% |
| N | 849 | 3.3% |
| S | 753 | 3.0% |
| Other values (16) | 4944 |
Lowercase Letter
| Value | Count | Frequency (%) |
| w | 34770 | 9.7% |
| n | 34770 | 9.7% |
| t | 34770 | 9.7% |
| h | 23180 | 6.5% |
| r | 23180 | 6.5% |
| e | 23180 | 6.5% |
| i | 23180 | 6.5% |
| m | 23180 | 6.5% |
| g | 23180 | 6.5% |
| q | 11590 | 3.2% |
| Other values (9) | 104310 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 9784 | |
| 1 | 8422 | |
| 2 | 7230 | |
| 5 | 7006 | |
| 4 | 6944 | |
| 9 | 6757 | |
| 0 | 6595 | |
| 3 | 6298 | |
| 8 | 6093 | |
| 6 | 5985 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 46361 | |
| / | 34770 | |
| ? | 11590 | 10.4% |
| : | 11590 | 10.4% |
| ; | 7103 | 6.4% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 11590 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 420 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 384634 | |
| Common | 194539 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| w | 34770 | 9.0% |
| n | 34770 | 9.0% |
| t | 34770 | 9.0% |
| h | 23180 | 6.0% |
| r | 23180 | 6.0% |
| e | 23180 | 6.0% |
| i | 23180 | 6.0% |
| m | 23180 | 6.0% |
| g | 23180 | 6.0% |
| q | 11590 | 3.0% |
| Other values (35) | 129654 |
Common
| Value | Count | Frequency (%) |
| . | 46361 | |
| / | 34770 | |
| = | 11590 | 6.0% |
| ? | 11590 | 6.0% |
| : | 11590 | 6.0% |
| 7 | 9784 | 5.0% |
| 1 | 8422 | 4.3% |
| 2 | 7230 | 3.7% |
| ; | 7103 | 3.7% |
| 5 | 7006 | 3.6% |
| Other values (8) | 39093 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 579173 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 46361 | 8.0% |
| / | 34770 | 6.0% |
| w | 34770 | 6.0% |
| n | 34770 | 6.0% |
| t | 34770 | 6.0% |
| h | 23180 | 4.0% |
| r | 23180 | 4.0% |
| e | 23180 | 4.0% |
| i | 23180 | 4.0% |
| m | 23180 | 4.0% |
| Other values (53) | 277832 |
Missing 
| Distinct | 7341 |
|---|---|
| Distinct (%) | 60.3% |
| Missing | 572414 |
| Missing (%) | 97.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 6354 |
|---|---|
| Median length | 555 |
| Mean length | 50.68484152 |
| Min length | 1 |
Unique
| Unique | 6370 ? |
|---|---|
| Unique (%) | 52.3% |
Sample
| 1st row | carcass saved |
|---|---|
| 2nd row | fertile |
| 3rd row | A second soft part color is listed, but it is in French. It needs translated; the handwriting is somewhat smushed and hard to read. Appears to be "Patte et tour des yeux carminis." [Feet and eye ring carmine?] |
| 4th row | breeding |
| 5th row | W.P. Taylor |
| Value | Count | Frequency (%) |
| of | 4593 | 4.4% |
| in | 2349 | 2.2% |
| as | 2209 | 2.1% |
| the | 2118 | 2.0% |
| usnm | 2055 | 2.0% |
| tag | 1748 | 1.7% |
| specimens | 1534 | 1.5% |
| cataloged | 1516 | 1.4% |
| 1422 | 1.4% | |
| originally | 1393 | 1.3% |
| Other values (10725) | 84151 |
Most occurring characters
| Value | Count | Frequency (%) |
| 92912 | ||
| e | 51486 | 8.3% |
| a | 37742 | 6.1% |
| n | 34944 | 5.7% |
| o | 33997 | 5.5% |
| i | 32379 | 5.2% |
| t | 32167 | 5.2% |
| s | 26495 | 4.3% |
| r | 25801 | 4.2% |
| l | 22827 | 3.7% |
| Other values (92) | 226490 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 429839 | |
| Space Separator | 92912 | 15.1% |
| Uppercase Letter | 38725 | 6.3% |
| Decimal Number | 34189 | 5.5% |
| Other Punctuation | 18120 | 2.9% |
| Dash Punctuation | 1707 | 0.3% |
| Open Punctuation | 745 | 0.1% |
| Close Punctuation | 743 | 0.1% |
| Math Symbol | 219 | < 0.1% |
| Final Punctuation | 12 | < 0.1% |
| Other values (5) | 29 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 51486 | |
| a | 37742 | 8.8% |
| n | 34944 | 8.1% |
| o | 33997 | 7.9% |
| i | 32379 | 7.5% |
| t | 32167 | 7.5% |
| s | 26495 | 6.2% |
| r | 25801 | 6.0% |
| l | 22827 | 5.3% |
| d | 18987 | 4.4% |
| Other values (20) | 113014 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 4637 | |
| N | 4058 | 10.5% |
| M | 3832 | 9.9% |
| U | 3571 | 9.2% |
| C | 3039 | 7.8% |
| O | 2136 | 5.5% |
| A | 1965 | 5.1% |
| T | 1912 | 4.9% |
| B | 1512 | 3.9% |
| F | 1474 | 3.8% |
| Other values (16) | 10589 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7077 | |
| , | 3513 | |
| : | 1896 | 10.5% |
| ; | 1812 | 10.0% |
| " | 1381 | 7.6% |
| # | 887 | 4.9% |
| / | 499 | 2.8% |
| ' | 346 | 1.9% |
| & | 321 | 1.8% |
| ? | 158 | 0.9% |
| Other values (5) | 230 | 1.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5919 | |
| 2 | 4779 | |
| 0 | 3893 | |
| 5 | 3430 | |
| 3 | 3030 | |
| 6 | 3028 | |
| 4 | 2960 | |
| 9 | 2714 | |
| 8 | 2382 | |
| 7 | 2054 | 6.0% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 92 | |
| = | 68 | |
| > | 34 | 15.5% |
| < | 14 | 6.4% |
| ± | 9 | 4.1% |
| ~ | 2 | 0.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1699 | |
| – | 6 | 0.4% |
| — | 1 | 0.1% |
| ‒ | 1 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 633 | |
| [ | 112 | 15.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 632 | |
| ] | 111 | 14.9% |
Space Separator
| Value | Count | Frequency (%) |
| 92912 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 12 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 12 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 11 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 3 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 2 |
Other Letter
| Value | Count | Frequency (%) |
| º | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 468565 | |
| Common | 148675 | 24.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 51486 | 11.0% |
| a | 37742 | 8.1% |
| n | 34944 | 7.5% |
| o | 33997 | 7.3% |
| i | 32379 | 6.9% |
| t | 32167 | 6.9% |
| s | 26495 | 5.7% |
| r | 25801 | 5.5% |
| l | 22827 | 4.9% |
| d | 18987 | 4.1% |
| Other values (47) | 151740 |
Common
| Value | Count | Frequency (%) |
| 92912 | ||
| . | 7077 | 4.8% |
| 1 | 5919 | 4.0% |
| 2 | 4779 | 3.2% |
| 0 | 3893 | 2.6% |
| , | 3513 | 2.4% |
| 5 | 3430 | 2.3% |
| 3 | 3030 | 2.0% |
| 6 | 3028 | 2.0% |
| 4 | 2960 | 2.0% |
| Other values (35) | 18134 | 12.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 617187 | |
| Punctuation | 32 | < 0.1% |
| None | 21 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 92912 | ||
| e | 51486 | 8.3% |
| a | 37742 | 6.1% |
| n | 34944 | 5.7% |
| o | 33997 | 5.5% |
| i | 32379 | 5.2% |
| t | 32167 | 5.2% |
| s | 26495 | 4.3% |
| r | 25801 | 4.2% |
| l | 22827 | 3.7% |
| Other values (80) | 226437 |
Punctuation
| Value | Count | Frequency (%) |
| ” | 12 | |
| “ | 12 | |
| – | 6 | |
| — | 1 | 3.1% |
| ‒ | 1 | 3.1% |
None
| Value | Count | Frequency (%) |
| ± | 9 | |
| é | 3 | 14.3% |
| ó | 2 | 9.5% |
| ñ | 2 | 9.5% |
| ç | 2 | 9.5% |
| ° | 2 | 9.5% |
| º | 1 | 4.8% |
eventDate
Text
Missing 
| Distinct | 51161 |
|---|---|
| Distinct (%) | 9.4% |
| Missing | 41361 |
| Missing (%) | 7.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 10 |
| Mean length | 9.758292513 |
| Min length | 4 |
Unique
| Unique | 7962 ? |
|---|---|
| Unique (%) | 1.5% |
Sample
| 1st row | 1859-05 |
|---|---|
| 2nd row | 1883-03-18 |
| 3rd row | 1895-05-26 |
| 4th row | 1924-08-06 |
| 5th row | 1987-04-09 |
| Value | Count | Frequency (%) |
| 1865 | 620 | 0.1% |
| 1877 | 533 | 0.1% |
| 1966 | 478 | 0.1% |
| 1926 | 419 | 0.1% |
| 1939-07 | 366 | 0.1% |
| 1937 | 360 | 0.1% |
| 1936 | 281 | 0.1% |
| 1884 | 276 | 0.1% |
| 1888 | 253 | < 0.1% |
| 1881 | 250 | < 0.1% |
| Other values (51151) | 539395 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 1042086 | |
| 1 | 1031643 | |
| 0 | 807962 | |
| 9 | 611293 | |
| 2 | 400793 | 7.6% |
| 8 | 308847 | 5.8% |
| 6 | 249386 | 4.7% |
| 3 | 225349 | 4.3% |
| 5 | 223549 | 4.2% |
| 4 | 216076 | 4.1% |
| Other values (2) | 184023 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4258556 | |
| Dash Punctuation | 1042086 | 19.7% |
| Other Punctuation | 365 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1031643 | |
| 0 | 807962 | |
| 9 | 611293 | |
| 2 | 400793 | 9.4% |
| 8 | 308847 | 7.3% |
| 6 | 249386 | 5.9% |
| 3 | 225349 | 5.3% |
| 5 | 223549 | 5.2% |
| 4 | 216076 | 5.1% |
| 7 | 183658 | 4.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1042086 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 365 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5301007 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 1042086 | |
| 1 | 1031643 | |
| 0 | 807962 | |
| 9 | 611293 | |
| 2 | 400793 | 7.6% |
| 8 | 308847 | 5.8% |
| 6 | 249386 | 4.7% |
| 3 | 225349 | 4.3% |
| 5 | 223549 | 4.2% |
| 4 | 216076 | 4.1% |
| Other values (2) | 184023 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5301007 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 1042086 | |
| 1 | 1031643 | |
| 0 | 807962 | |
| 9 | 611293 | |
| 2 | 400793 | 7.6% |
| 8 | 308847 | 5.8% |
| 6 | 249386 | 4.7% |
| 3 | 225349 | 4.3% |
| 5 | 223549 | 4.2% |
| 4 | 216076 | 4.1% |
| Other values (2) | 184023 | 3.5% |
startDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 74069 |
| Missing (%) | 12.7% |
| Memory size | 4.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.717769816 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 77 |
|---|---|
| 2nd row | 146 |
| 3rd row | 219 |
| 4th row | 99 |
| 5th row | 274 |
| Value | Count | Frequency (%) |
| 140 | 2507 | 0.5% |
| 141 | 2480 | 0.5% |
| 134 | 2428 | 0.5% |
| 135 | 2416 | 0.5% |
| 150 | 2400 | 0.5% |
| 142 | 2384 | 0.5% |
| 136 | 2383 | 0.5% |
| 166 | 2363 | 0.5% |
| 139 | 2355 | 0.5% |
| 132 | 2336 | 0.5% |
| Other values (356) | 486471 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 294060 | |
| 2 | 227022 | |
| 3 | 170842 | |
| 5 | 108195 | 7.8% |
| 4 | 107779 | 7.8% |
| 6 | 104873 | 7.6% |
| 7 | 96252 | 6.9% |
| 8 | 93293 | 6.7% |
| 9 | 92796 | 6.7% |
| 0 | 92372 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1387484 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 294060 | |
| 2 | 227022 | |
| 3 | 170842 | |
| 5 | 108195 | 7.8% |
| 4 | 107779 | 7.8% |
| 6 | 104873 | 7.6% |
| 7 | 96252 | 6.9% |
| 8 | 93293 | 6.7% |
| 9 | 92796 | 6.7% |
| 0 | 92372 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1387484 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 294060 | |
| 2 | 227022 | |
| 3 | 170842 | |
| 5 | 108195 | 7.8% |
| 4 | 107779 | 7.8% |
| 6 | 104873 | 7.6% |
| 7 | 96252 | 6.9% |
| 8 | 93293 | 6.7% |
| 9 | 92796 | 6.7% |
| 0 | 92372 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1387484 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 294060 | |
| 2 | 227022 | |
| 3 | 170842 | |
| 5 | 108195 | 7.8% |
| 4 | 107779 | 7.8% |
| 6 | 104873 | 7.6% |
| 7 | 96252 | 6.9% |
| 8 | 93293 | 6.7% |
| 9 | 92796 | 6.7% |
| 0 | 92372 | 6.7% |
endDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 74069 |
| Missing (%) | 12.7% |
| Memory size | 4.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.717808992 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 77 |
|---|---|
| 2nd row | 146 |
| 3rd row | 219 |
| 4th row | 99 |
| 5th row | 274 |
| Value | Count | Frequency (%) |
| 140 | 2508 | 0.5% |
| 141 | 2480 | 0.5% |
| 134 | 2427 | 0.5% |
| 135 | 2416 | 0.5% |
| 150 | 2398 | 0.5% |
| 136 | 2383 | 0.5% |
| 142 | 2380 | 0.5% |
| 166 | 2363 | 0.5% |
| 139 | 2354 | 0.5% |
| 132 | 2338 | 0.5% |
| Other values (356) | 486476 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 294066 | |
| 2 | 227032 | |
| 3 | 170858 | |
| 5 | 108162 | 7.8% |
| 4 | 107797 | 7.8% |
| 6 | 104854 | 7.6% |
| 7 | 96249 | 6.9% |
| 8 | 93293 | 6.7% |
| 9 | 92805 | 6.7% |
| 0 | 92388 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1387504 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 294066 | |
| 2 | 227032 | |
| 3 | 170858 | |
| 5 | 108162 | 7.8% |
| 4 | 107797 | 7.8% |
| 6 | 104854 | 7.6% |
| 7 | 96249 | 6.9% |
| 8 | 93293 | 6.7% |
| 9 | 92805 | 6.7% |
| 0 | 92388 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1387504 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 294066 | |
| 2 | 227032 | |
| 3 | 170858 | |
| 5 | 108162 | 7.8% |
| 4 | 107797 | 7.8% |
| 6 | 104854 | 7.6% |
| 7 | 96249 | 6.9% |
| 8 | 93293 | 6.7% |
| 9 | 92805 | 6.7% |
| 0 | 92388 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1387504 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 294066 | |
| 2 | 227032 | |
| 3 | 170858 | |
| 5 | 108162 | 7.8% |
| 4 | 107797 | 7.8% |
| 6 | 104854 | 7.6% |
| 7 | 96249 | 6.9% |
| 8 | 93293 | 6.7% |
| 9 | 92805 | 6.7% |
| 0 | 92388 | 6.7% |
year
Text
Missing 
| Distinct | 204 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 41376 |
| Missing (%) | 7.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1859 |
|---|---|
| 2nd row | 1883 |
| 3rd row | 1895 |
| 4th row | 1924 |
| 5th row | 1987 |
| Value | Count | Frequency (%) |
| 1965 | 14461 | 2.7% |
| 1964 | 13001 | 2.4% |
| 1966 | 10898 | 2.0% |
| 1912 | 9421 | 1.7% |
| 1911 | 8196 | 1.5% |
| 1949 | 8030 | 1.5% |
| 1923 | 7871 | 1.4% |
| 1950 | 6975 | 1.3% |
| 1967 | 6970 | 1.3% |
| 1892 | 6943 | 1.3% |
| Other values (194) | 450450 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 642014 | |
| 9 | 524891 | |
| 8 | 217754 | 10.0% |
| 6 | 138209 | 6.4% |
| 0 | 135629 | 6.2% |
| 2 | 113128 | 5.2% |
| 4 | 110538 | 5.1% |
| 5 | 102840 | 4.7% |
| 3 | 100862 | 4.6% |
| 7 | 86999 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2172864 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 642014 | |
| 9 | 524891 | |
| 8 | 217754 | 10.0% |
| 6 | 138209 | 6.4% |
| 0 | 135629 | 6.2% |
| 2 | 113128 | 5.2% |
| 4 | 110538 | 5.1% |
| 5 | 102840 | 4.7% |
| 3 | 100862 | 4.6% |
| 7 | 86999 | 4.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2172864 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 642014 | |
| 9 | 524891 | |
| 8 | 217754 | 10.0% |
| 6 | 138209 | 6.4% |
| 0 | 135629 | 6.2% |
| 2 | 113128 | 5.2% |
| 4 | 110538 | 5.1% |
| 5 | 102840 | 4.7% |
| 3 | 100862 | 4.6% |
| 7 | 86999 | 4.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2172864 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 642014 | |
| 9 | 524891 | |
| 8 | 217754 | 10.0% |
| 6 | 138209 | 6.4% |
| 0 | 135629 | 6.2% |
| 2 | 113128 | 5.2% |
| 4 | 110538 | 5.1% |
| 5 | 102840 | 4.7% |
| 3 | 100862 | 4.6% |
| 7 | 86999 | 4.0% |
month
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 53877 |
| Missing (%) | 9.2% |
| Memory size | 4.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.178898279 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 5 |
|---|---|
| 2nd row | 3 |
| 3rd row | 5 |
| 4th row | 8 |
| 5th row | 4 |
| Value | Count | Frequency (%) |
| 5 | 70341 | |
| 6 | 61173 | |
| 4 | 54173 | |
| 3 | 50525 | |
| 7 | 46973 | |
| 2 | 40464 | |
| 8 | 39913 | |
| 9 | 37742 | |
| 10 | 35465 | |
| 1 | 34467 | |
| Other values (2) | 59479 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 160144 | |
| 5 | 70341 | |
| 2 | 69210 | |
| 6 | 61173 | 9.8% |
| 4 | 54173 | 8.7% |
| 3 | 50525 | 8.1% |
| 7 | 46973 | 7.5% |
| 8 | 39913 | 6.4% |
| 9 | 37742 | 6.0% |
| 0 | 35465 | 5.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 625659 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 160144 | |
| 5 | 70341 | |
| 2 | 69210 | |
| 6 | 61173 | 9.8% |
| 4 | 54173 | 8.7% |
| 3 | 50525 | 8.1% |
| 7 | 46973 | 7.5% |
| 8 | 39913 | 6.4% |
| 9 | 37742 | 6.0% |
| 0 | 35465 | 5.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 625659 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 160144 | |
| 5 | 70341 | |
| 2 | 69210 | |
| 6 | 61173 | 9.8% |
| 4 | 54173 | 8.7% |
| 3 | 50525 | 8.1% |
| 7 | 46973 | 7.5% |
| 8 | 39913 | 6.4% |
| 9 | 37742 | 6.0% |
| 0 | 35465 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 625659 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 160144 | |
| 5 | 70341 | |
| 2 | 69210 | |
| 6 | 61173 | 9.8% |
| 4 | 54173 | 8.7% |
| 3 | 50525 | 8.1% |
| 7 | 46973 | 7.5% |
| 8 | 39913 | 6.4% |
| 9 | 37742 | 6.0% |
| 0 | 35465 | 5.7% |
day
Text
Missing 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 74434 |
| Missing (%) | 12.7% |
| Memory size | 4.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.707414174 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 18 |
|---|---|
| 2nd row | 26 |
| 3rd row | 6 |
| 4th row | 9 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 20 | 17976 | 3.5% |
| 10 | 17940 | 3.5% |
| 8 | 17679 | 3.5% |
| 15 | 17667 | 3.5% |
| 21 | 17460 | 3.4% |
| 12 | 17459 | 3.4% |
| 24 | 17311 | 3.4% |
| 22 | 17146 | 3.4% |
| 4 | 17141 | 3.4% |
| 16 | 17122 | 3.4% |
| Other values (21) | 335257 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 228583 | |
| 2 | 217994 | |
| 3 | 73728 | 8.5% |
| 4 | 51207 | 5.9% |
| 8 | 50991 | 5.9% |
| 0 | 50799 | 5.8% |
| 5 | 50165 | 5.8% |
| 6 | 49818 | 5.7% |
| 7 | 49449 | 5.7% |
| 9 | 48317 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 871051 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 228583 | |
| 2 | 217994 | |
| 3 | 73728 | 8.5% |
| 4 | 51207 | 5.9% |
| 8 | 50991 | 5.9% |
| 0 | 50799 | 5.8% |
| 5 | 50165 | 5.8% |
| 6 | 49818 | 5.7% |
| 7 | 49449 | 5.7% |
| 9 | 48317 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 871051 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 228583 | |
| 2 | 217994 | |
| 3 | 73728 | 8.5% |
| 4 | 51207 | 5.9% |
| 8 | 50991 | 5.9% |
| 0 | 50799 | 5.8% |
| 5 | 50165 | 5.8% |
| 6 | 49818 | 5.7% |
| 7 | 49449 | 5.7% |
| 9 | 48317 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 871051 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 228583 | |
| 2 | 217994 | |
| 3 | 73728 | 8.5% |
| 4 | 51207 | 5.9% |
| 8 | 50991 | 5.9% |
| 0 | 50799 | 5.8% |
| 5 | 50165 | 5.8% |
| 6 | 49818 | 5.7% |
| 7 | 49449 | 5.7% |
| 9 | 48317 | 5.5% |
Missing 
| Distinct | 43994 |
|---|---|
| Distinct (%) | 12.6% |
| Missing | 235442 |
| Missing (%) | 40.3% |
| Memory size | 4.5 MiB |
Length
| Max length | 60 |
|---|---|
| Median length | 11 |
| Mean length | 10.64060719 |
| Min length | 1 |
Unique
| Unique | 10311 ? |
|---|---|
| Unique (%) | 3.0% |
Sample
| 1st row | -- May 1859 |
|---|---|
| 2nd row | 18 Mar 1883 |
| 3rd row | 26 May 1895 |
| 4th row | 6 Aug 1924 |
| 5th row | 9 Apr 1987 |
| Value | Count | Frequency (%) |
| 149965 | 14.3% | |
| may | 43235 | 4.1% |
| jun | 37603 | 3.6% |
| apr | 31571 | 3.0% |
| mar | 27292 | 2.6% |
| jul | 27206 | 2.6% |
| aug | 23700 | 2.3% |
| feb | 21866 | 2.1% |
| sep | 21167 | 2.0% |
| jan | 18181 | 1.7% |
| Other values (727) | 644585 |
Most occurring characters
| Value | Count | Frequency (%) |
| 697221 | ||
| 1 | 503447 | |
| - | 381992 | 10.3% |
| 9 | 327404 | 8.8% |
| 2 | 174483 | 4.7% |
| 8 | 174195 | 4.7% |
| 6 | 106883 | 2.9% |
| 3 | 99628 | 2.7% |
| 4 | 93965 | 2.5% |
| a | 89421 | 2.4% |
| Other values (67) | 1066529 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1729072 | |
| Space Separator | 697221 | |
| Lowercase Letter | 605710 | 16.3% |
| Dash Punctuation | 381992 | 10.3% |
| Uppercase Letter | 300576 | 8.1% |
| Other Punctuation | 550 | < 0.1% |
| Close Punctuation | 16 | < 0.1% |
| Open Punctuation | 16 | < 0.1% |
| Math Symbol | 8 | < 0.1% |
| Format | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 89421 | |
| u | 89025 | |
| r | 60311 | |
| e | 56954 | |
| n | 56861 | |
| p | 53345 | |
| y | 43351 | |
| c | 31070 | 5.1% |
| l | 28240 | 4.7% |
| g | 24158 | 4.0% |
| Other values (14) | 72974 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 83206 | |
| M | 70639 | |
| A | 55418 | |
| F | 22373 | 7.4% |
| S | 21945 | 7.3% |
| O | 18184 | 6.0% |
| N | 15173 | 5.0% |
| D | 12829 | 4.3% |
| W | 412 | 0.1% |
| I | 175 | 0.1% |
| Other values (14) | 222 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 503447 | |
| 9 | 327404 | |
| 2 | 174483 | 10.1% |
| 8 | 174195 | 10.1% |
| 6 | 106883 | 6.2% |
| 3 | 99628 | 5.8% |
| 4 | 93965 | 5.4% |
| 0 | 87653 | 5.1% |
| 5 | 82545 | 4.8% |
| 7 | 78869 | 4.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 176 | |
| . | 144 | |
| , | 89 | |
| ? | 49 | 8.9% |
| ' | 34 | 6.2% |
| : | 32 | 5.8% |
| & | 12 | 2.2% |
| \ | 11 | 2.0% |
| " | 2 | 0.4% |
| # | 1 | 0.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 8 | |
| ) | 8 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 8 | |
| [ | 8 |
Space Separator
| Value | Count | Frequency (%) |
| 697221 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 381992 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 8 |
Format
| Value | Count | Frequency (%) |
| | 4 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2808882 | |
| Latin | 906286 | 24.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 89421 | 9.9% |
| u | 89025 | 9.8% |
| J | 83206 | 9.2% |
| M | 70639 | 7.8% |
| r | 60311 | 6.7% |
| e | 56954 | 6.3% |
| n | 56861 | 6.3% |
| A | 55418 | 6.1% |
| p | 53345 | 5.9% |
| y | 43351 | 4.8% |
| Other values (38) | 247755 |
Common
| Value | Count | Frequency (%) |
| 697221 | ||
| 1 | 503447 | |
| - | 381992 | |
| 9 | 327404 | |
| 2 | 174483 | 6.2% |
| 8 | 174195 | 6.2% |
| 6 | 106883 | 3.8% |
| 3 | 99628 | 3.5% |
| 4 | 93965 | 3.3% |
| 0 | 87653 | 3.1% |
| Other values (19) | 162011 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3715164 | |
| Punctuation | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 697221 | ||
| 1 | 503447 | |
| - | 381992 | 10.3% |
| 9 | 327404 | 8.8% |
| 2 | 174483 | 4.7% |
| 8 | 174195 | 4.7% |
| 6 | 106883 | 2.9% |
| 3 | 99628 | 2.7% |
| 4 | 93965 | 2.5% |
| a | 89421 | 2.4% |
| Other values (66) | 1066525 |
Punctuation
| Value | Count | Frequency (%) |
| | 4 |
habitat
Text
Missing 
| Distinct | 4924 |
|---|---|
| Distinct (%) | 28.6% |
| Missing | 567355 |
| Missing (%) | 97.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 191 |
|---|---|
| Median length | 141 |
| Mean length | 27.13418808 |
| Min length | 3 |
Unique
| Unique | 3478 ? |
|---|---|
| Unique (%) | 20.2% |
Sample
| 1st row | IN OPEN OCEAN AT 0835 |
|---|---|
| 2nd row | dense marshy grass |
| 3rd row | Along lake shore, water and dead brush |
| 4th row | airport |
| 5th row | montane forest edge |
| Value | Count | Frequency (%) |
| forest | 6854 | 9.3% |
| with | 2343 | 3.2% |
| open | 1915 | 2.6% |
| of | 1628 | 2.2% |
| in | 1549 | 2.1% |
| and | 1461 | 2.0% |
| scrub | 1279 | 1.7% |
| edge | 1213 | 1.6% |
| 945 | 1.3% | |
| on | 919 | 1.2% |
| Other values (2526) | 53491 |
Most occurring characters
| Value | Count | Frequency (%) |
| 56360 | 12.1% | |
| e | 41846 | 8.9% |
| o | 33590 | 7.2% |
| a | 33285 | 7.1% |
| s | 31715 | 6.8% |
| r | 31696 | 6.8% |
| t | 25427 | 5.4% |
| n | 24905 | 5.3% |
| i | 21550 | 4.6% |
| l | 17791 | 3.8% |
| Other values (72) | 149547 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 376043 | |
| Space Separator | 56360 | 12.1% |
| Uppercase Letter | 25209 | 5.4% |
| Other Punctuation | 6419 | 1.4% |
| Dash Punctuation | 1495 | 0.3% |
| Decimal Number | 1436 | 0.3% |
| Open Punctuation | 365 | 0.1% |
| Close Punctuation | 365 | 0.1% |
| Math Symbol | 20 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 41846 | |
| o | 33590 | 8.9% |
| a | 33285 | 8.9% |
| s | 31715 | 8.4% |
| r | 31696 | 8.4% |
| t | 25427 | 6.8% |
| n | 24905 | 6.6% |
| i | 21550 | 5.7% |
| l | 17791 | 4.7% |
| d | 17630 | 4.7% |
| Other values (16) | 96608 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 2706 | 10.7% |
| E | 2400 | 9.5% |
| R | 2179 | 8.6% |
| A | 2016 | 8.0% |
| N | 1705 | 6.8% |
| S | 1663 | 6.6% |
| L | 1499 | 5.9% |
| I | 1493 | 5.9% |
| T | 1420 | 5.6% |
| C | 1283 | 5.1% |
| Other values (16) | 6845 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4306 | |
| . | 571 | 8.9% |
| ; | 545 | 8.5% |
| & | 456 | 7.1% |
| / | 439 | 6.8% |
| " | 28 | 0.4% |
| : | 27 | 0.4% |
| ' | 25 | 0.4% |
| ? | 15 | 0.2% |
| # | 4 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 592 | |
| 5 | 303 | |
| 1 | 155 | 10.8% |
| 2 | 135 | 9.4% |
| 3 | 126 | 8.8% |
| 4 | 53 | 3.7% |
| 6 | 27 | 1.9% |
| 8 | 17 | 1.2% |
| 7 | 17 | 1.2% |
| 9 | 11 | 0.8% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 8 | |
| < | 7 | |
| = | 5 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 350 | |
| [ | 15 | 4.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 350 | |
| ] | 15 | 4.1% |
Space Separator
| Value | Count | Frequency (%) |
| 56360 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1495 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 401252 | |
| Common | 66460 | 14.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 41846 | 10.4% |
| o | 33590 | 8.4% |
| a | 33285 | 8.3% |
| s | 31715 | 7.9% |
| r | 31696 | 7.9% |
| t | 25427 | 6.3% |
| n | 24905 | 6.2% |
| i | 21550 | 5.4% |
| l | 17791 | 4.4% |
| d | 17630 | 4.4% |
| Other values (42) | 121817 |
Common
| Value | Count | Frequency (%) |
| 56360 | ||
| , | 4306 | 6.5% |
| - | 1495 | 2.2% |
| 0 | 592 | 0.9% |
| . | 571 | 0.9% |
| ; | 545 | 0.8% |
| & | 456 | 0.7% |
| / | 439 | 0.7% |
| ( | 350 | 0.5% |
| ) | 350 | 0.5% |
| Other values (20) | 996 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 467712 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 56360 | 12.1% | |
| e | 41846 | 8.9% |
| o | 33590 | 7.2% |
| a | 33285 | 7.1% |
| s | 31715 | 6.8% |
| r | 31696 | 6.8% |
| t | 25427 | 5.4% |
| n | 24905 | 5.3% |
| i | 21550 | 4.6% |
| l | 17791 | 3.8% |
| Other values (72) | 149547 |
higherGeography
Text
| Distinct | 6798 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 95 |
|---|---|
| Median length | 75 |
| Mean length | 36.76763373 |
| Min length | 4 |
Unique
| Unique | 1458 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | South America, Paraguay, Asuncion |
|---|---|
| 2nd row | North America, United States, Florida |
| 3rd row | North America, United States, South Dakota |
| 4th row | North America, United States, Maine |
| 5th row | Asia, Philippines, Palawan, Palawan Province |
| Value | Count | Frequency (%) |
| america | 389870 | 13.5% |
| north | 349097 | 12.1% |
| united | 213165 | 7.4% |
| states | 211488 | 7.4% |
| asia | 94981 | 3.3% |
| south | 88499 | 3.1% |
| africa | 52986 | 1.8% |
| mexico | 32547 | 1.1% |
| panama | 31800 | 1.1% |
| colombia | 28517 | 1.0% |
| Other values (4797) | 1384325 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2292685 | 10.7% | |
| a | 2269264 | 10.6% |
| i | 1576409 | 7.3% |
| e | 1449846 | 6.7% |
| t | 1415514 | 6.6% |
| r | 1302972 | 6.1% |
| , | 1293349 | 6.0% |
| o | 1083406 | 5.0% |
| n | 1034429 | 4.8% |
| s | 708939 | 3.3% |
| Other values (64) | 7067178 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 14996132 | |
| Uppercase Letter | 2873711 | 13.4% |
| Space Separator | 2292685 | 10.7% |
| Other Punctuation | 1312996 | 6.1% |
| Dash Punctuation | 16199 | 0.1% |
| Open Punctuation | 1132 | < 0.1% |
| Close Punctuation | 1131 | < 0.1% |
| Decimal Number | 3 | < 0.1% |
| Math Symbol | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2269264 | |
| i | 1576409 | |
| e | 1449846 | |
| t | 1415514 | |
| r | 1302972 | |
| o | 1083406 | 7.2% |
| n | 1034429 | 6.9% |
| s | 708939 | 4.7% |
| c | 702577 | 4.7% |
| h | 651096 | 4.3% |
| Other values (19) | 2801680 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 655752 | |
| N | 422462 | |
| S | 371915 | |
| U | 235640 | 8.2% |
| C | 213796 | 7.4% |
| M | 131471 | 4.6% |
| P | 129994 | 4.5% |
| I | 77028 | 2.7% |
| B | 69307 | 2.4% |
| T | 68706 | 2.4% |
| Other values (16) | 497640 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1293349 | |
| ' | 6575 | 0.5% |
| . | 5181 | 0.4% |
| ? | 4098 | 0.3% |
| / | 3790 | 0.3% |
| & | 2 | < 0.1% |
| \ | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 8 | 1 | |
| 6 | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 16138 | |
| – | 61 | 0.4% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1125 | |
| [ | 7 | 0.6% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1124 | |
| ] | 7 | 0.6% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 | |
| ~ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2292685 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17869843 | |
| Common | 3624148 | 16.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2269264 | |
| i | 1576409 | 8.8% |
| e | 1449846 | 8.1% |
| t | 1415514 | 7.9% |
| r | 1302972 | 7.3% |
| o | 1083406 | 6.1% |
| n | 1034429 | 5.8% |
| s | 708939 | 4.0% |
| c | 702577 | 3.9% |
| A | 655752 | 3.7% |
| Other values (45) | 5670735 |
Common
| Value | Count | Frequency (%) |
| 2292685 | ||
| , | 1293349 | |
| - | 16138 | 0.4% |
| ' | 6575 | 0.2% |
| . | 5181 | 0.1% |
| ? | 4098 | 0.1% |
| / | 3790 | 0.1% |
| ( | 1125 | < 0.1% |
| ) | 1124 | < 0.1% |
| – | 61 | < 0.1% |
| Other values (9) | 22 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21493923 | |
| Punctuation | 61 | < 0.1% |
| None | 7 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2292685 | 10.7% | |
| a | 2269264 | 10.6% |
| i | 1576409 | 7.3% |
| e | 1449846 | 6.7% |
| t | 1415514 | 6.6% |
| r | 1302972 | 6.1% |
| , | 1293349 | 6.0% |
| o | 1083406 | 5.0% |
| n | 1034429 | 4.8% |
| s | 708939 | 3.3% |
| Other values (60) | 7067110 |
Punctuation
| Value | Count | Frequency (%) |
| – | 61 |
None
| Value | Count | Frequency (%) |
| ô | 4 | |
| é | 2 | |
| ä | 1 | 14.3% |
continent
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 27500 |
| Missing (%) | 4.7% |
| Memory size | 4.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 10.59729093 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SOUTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | ASIA |
| Value | Count | Frequency (%) |
| north_america | 322157 | |
| asia | 96833 | 17.4% |
| south_america | 69099 | 12.4% |
| africa | 47406 | 8.5% |
| oceania | 11848 | 2.1% |
| europe | 8714 | 1.6% |
| antarctica | 1035 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1097791 | |
| R | 770568 | |
| I | 548378 | |
| C | 452580 | |
| E | 420532 | 7.1% |
| O | 411818 | 7.0% |
| T | 393326 | 6.7% |
| H | 391256 | 6.6% |
| _ | 391256 | 6.6% |
| M | 391256 | 6.6% |
| Other values (5) | 634905 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 5512410 | |
| Connector Punctuation | 391256 | 6.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1097791 | |
| R | 770568 | |
| I | 548378 | |
| C | 452580 | |
| E | 420532 | 7.6% |
| O | 411818 | 7.5% |
| T | 393326 | 7.1% |
| H | 391256 | 7.1% |
| M | 391256 | 7.1% |
| N | 335040 | 6.1% |
| Other values (4) | 299865 | 5.4% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 391256 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5512410 | |
| Common | 391256 | 6.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1097791 | |
| R | 770568 | |
| I | 548378 | |
| C | 452580 | |
| E | 420532 | 7.6% |
| O | 411818 | 7.5% |
| T | 393326 | 7.1% |
| H | 391256 | 7.1% |
| M | 391256 | 7.1% |
| N | 335040 | 6.1% |
| Other values (4) | 299865 | 5.4% |
Common
| Value | Count | Frequency (%) |
| _ | 391256 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5903666 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 1097791 | |
| R | 770568 | |
| I | 548378 | |
| C | 452580 | |
| E | 420532 | 7.1% |
| O | 411818 | 7.0% |
| T | 393326 | 6.7% |
| H | 391256 | 6.6% |
| _ | 391256 | 6.6% |
| M | 391256 | 6.6% |
| Other values (5) | 634905 |
waterBody
Text
Missing 
| Distinct | 67 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 558515 |
| Missing (%) | 95.5% |
| Memory size | 4.5 MiB |
Length
| Max length | 55 |
|---|---|
| Median length | 19 |
| Mean length | 20.14311462 |
| Min length | 8 |
Unique
| Unique | 18 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Arctic Ocean |
|---|---|
| 2nd row | North Pacific Ocean |
| 3rd row | North Pacific Ocean |
| 4th row | North Pacific Ocean |
| 5th row | North Pacific Ocean |
| Value | Count | Frequency (%) |
| ocean | 26055 | |
| pacific | 19043 | |
| north | 16048 | |
| south | 6719 | 8.3% |
| atlantic | 4113 | 5.1% |
| indian | 2690 | 3.3% |
| sea | 2523 | 3.1% |
| mediterranean | 1992 | 2.5% |
| weddell | 131 | 0.2% |
| arctic | 125 | 0.2% |
| Other values (57) | 1126 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 68650 | |
| a | 59282 | |
| 54488 | ||
| i | 47442 | |
| n | 40099 | 7.6% |
| e | 35322 | 6.7% |
| t | 33362 | 6.4% |
| O | 26120 | 5.0% |
| o | 23090 | 4.4% |
| h | 23023 | 4.4% |
| Other values (35) | 114394 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 387593 | |
| Uppercase Letter | 80498 | 15.3% |
| Space Separator | 54488 | 10.4% |
| Other Punctuation | 2693 | 0.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 68650 | |
| a | 59282 | |
| i | 47442 | |
| n | 40099 | |
| e | 35322 | |
| t | 33362 | |
| o | 23090 | 6.0% |
| h | 23023 | 5.9% |
| r | 20677 | 5.3% |
| f | 19227 | 5.0% |
| Other values (14) | 17419 | 4.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 26120 | |
| P | 19099 | |
| N | 16052 | |
| S | 9360 | 11.6% |
| A | 4242 | 5.3% |
| I | 2690 | 3.3% |
| M | 1995 | 2.5% |
| B | 240 | 0.3% |
| C | 217 | 0.3% |
| W | 158 | 0.2% |
| Other values (8) | 325 | 0.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2671 | |
| ? | 22 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 54488 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 468091 | |
| Common | 57181 | 10.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 68650 | |
| a | 59282 | |
| i | 47442 | |
| n | 40099 | |
| e | 35322 | 7.5% |
| t | 33362 | 7.1% |
| O | 26120 | 5.6% |
| o | 23090 | 4.9% |
| h | 23023 | 4.9% |
| r | 20677 | 4.4% |
| Other values (32) | 91024 |
Common
| Value | Count | Frequency (%) |
| 54488 | ||
| , | 2671 | 4.7% |
| ? | 22 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 525272 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 68650 | |
| a | 59282 | |
| 54488 | ||
| i | 47442 | |
| n | 40099 | 7.6% |
| e | 35322 | 6.7% |
| t | 33362 | 6.4% |
| O | 26120 | 5.0% |
| o | 23090 | 4.4% |
| h | 23023 | 4.4% |
| Other values (35) | 114394 |
countryCode
Text
| Distinct | 216 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3736 |
| Missing (%) | 0.6% |
| Memory size | 4.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PY |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | PH |
| Value | Count | Frequency (%) |
| us | 216836 | |
| co | 28553 | 4.9% |
| mx | 28229 | 4.9% |
| pa | 27171 | 4.7% |
| ca | 17452 | 3.0% |
| th | 17424 | 3.0% |
| ph | 16446 | 2.8% |
| zz | 16268 | 2.8% |
| cn | 14054 | 2.4% |
| id | 13339 | 2.3% |
| Other values (206) | 185084 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 233998 | |
| S | 224421 | |
| C | 85053 | 7.3% |
| A | 67721 | 5.8% |
| P | 59989 | 5.2% |
| M | 46638 | 4.0% |
| Z | 46463 | 4.0% |
| T | 40681 | 3.5% |
| E | 39938 | 3.4% |
| H | 39416 | 3.4% |
| Other values (16) | 277394 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1161712 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 233998 | |
| S | 224421 | |
| C | 85053 | 7.3% |
| A | 67721 | 5.8% |
| P | 59989 | 5.2% |
| M | 46638 | 4.0% |
| Z | 46463 | 4.0% |
| T | 40681 | 3.5% |
| E | 39938 | 3.4% |
| H | 39416 | 3.4% |
| Other values (16) | 277394 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1161712 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 233998 | |
| S | 224421 | |
| C | 85053 | 7.3% |
| A | 67721 | 5.8% |
| P | 59989 | 5.2% |
| M | 46638 | 4.0% |
| Z | 46463 | 4.0% |
| T | 40681 | 3.5% |
| E | 39938 | 3.4% |
| H | 39416 | 3.4% |
| Other values (16) | 277394 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1161712 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 233998 | |
| S | 224421 | |
| C | 85053 | 7.3% |
| A | 67721 | 5.8% |
| P | 59989 | 5.2% |
| M | 46638 | 4.0% |
| Z | 46463 | 4.0% |
| T | 40681 | 3.5% |
| E | 39938 | 3.4% |
| H | 39416 | 3.4% |
| Other values (16) | 277394 |
stateProvince
Text
Missing 
| Distinct | 2242 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 93871 |
| Missing (%) | 16.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 71 |
|---|---|
| Median length | 40 |
| Mean length | 9.131608388 |
| Min length | 3 |
Unique
| Unique | 420 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Asuncion |
|---|---|
| 2nd row | Florida |
| 3rd row | South Dakota |
| 4th row | Maine |
| 5th row | Palawan |
| Value | Count | Frequency (%) |
| california | 23409 | 3.6% |
| new | 20454 | 3.1% |
| alaska | 19385 | 3.0% |
| virginia | 14953 | 2.3% |
| arizona | 13147 | 2.0% |
| maryland | 10719 | 1.6% |
| florida | 10644 | 1.6% |
| texas | 9775 | 1.5% |
| columbia | 9291 | 1.4% |
| island | 9097 | 1.4% |
| Other values (2044) | 512747 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 688102 | |
| i | 363250 | 8.1% |
| n | 330347 | 7.4% |
| o | 310192 | 6.9% |
| r | 284632 | 6.4% |
| e | 240206 | 5.4% |
| l | 198665 | 4.4% |
| s | 197499 | 4.4% |
| 162900 | 3.6% | |
| t | 158835 | 3.5% |
| Other values (57) | 1546444 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3642328 | |
| Uppercase Letter | 655428 | 14.6% |
| Space Separator | 162900 | 3.6% |
| Dash Punctuation | 12832 | 0.3% |
| Other Punctuation | 7148 | 0.2% |
| Open Punctuation | 216 | < 0.1% |
| Close Punctuation | 216 | < 0.1% |
| Decimal Number | 3 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 688102 | |
| i | 363250 | |
| n | 330347 | |
| o | 310192 | |
| r | 284632 | 7.8% |
| e | 240206 | 6.6% |
| l | 198665 | 5.5% |
| s | 197499 | 5.4% |
| t | 158835 | 4.4% |
| u | 137454 | 3.8% |
| Other values (18) | 733146 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 87330 | |
| M | 61534 | 9.4% |
| A | 60288 | 9.2% |
| N | 58797 | 9.0% |
| S | 40319 | 6.2% |
| T | 35065 | 5.3% |
| I | 30921 | 4.7% |
| P | 30209 | 4.6% |
| D | 27952 | 4.3% |
| B | 25816 | 3.9% |
| Other values (16) | 197197 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 3008 | |
| ? | 1713 | |
| / | 1358 | |
| . | 901 | 12.6% |
| , | 168 | 2.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 8 | 1 | |
| 6 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 162900 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12832 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 216 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 216 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4297756 | |
| Common | 183316 | 4.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 688102 | |
| i | 363250 | 8.5% |
| n | 330347 | 7.7% |
| o | 310192 | 7.2% |
| r | 284632 | 6.6% |
| e | 240206 | 5.6% |
| l | 198665 | 4.6% |
| s | 197499 | 4.6% |
| t | 158835 | 3.7% |
| u | 137454 | 3.2% |
| Other values (44) | 1388574 |
Common
| Value | Count | Frequency (%) |
| 162900 | ||
| - | 12832 | 7.0% |
| ' | 3008 | 1.6% |
| ? | 1713 | 0.9% |
| / | 1358 | 0.7% |
| . | 901 | 0.5% |
| ( | 216 | 0.1% |
| ) | 216 | 0.1% |
| , | 168 | 0.1% |
| + | 1 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4481070 | |
| None | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 688102 | |
| i | 363250 | 8.1% |
| n | 330347 | 7.4% |
| o | 310192 | 6.9% |
| r | 284632 | 6.4% |
| e | 240206 | 5.4% |
| l | 198665 | 4.4% |
| s | 197499 | 4.4% |
| 162900 | 3.6% | |
| t | 158835 | 3.5% |
| Other values (55) | 1546442 |
None
| Value | Count | Frequency (%) |
| ô | 1 | |
| é | 1 |
county
Text
Missing 
| Distinct | 3216 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 353572 |
| Missing (%) | 60.5% |
| Memory size | 4.5 MiB |
Length
| Max length | 39 |
|---|---|
| Median length | 31 |
| Mean length | 9.707878106 |
| Min length | 1 |
Unique
| Unique | 641 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Palawan Province |
|---|---|
| 2nd row | Bergen |
| 3rd row | North Solomons Province |
| 4th row | Clarke |
| 5th row | Augusta |
| Value | Count | Frequency (%) |
| area | 7116 | 2.1% |
| census | 7108 | 2.1% |
| province | 5993 | 1.8% |
| bergen | 4929 | 1.5% |
| aleutians | 4466 | 1.3% |
| county | 4430 | 1.3% |
| west | 4293 | 1.3% |
| borough | 3777 | 1.1% |
| san | 3628 | 1.1% |
| latah | 3591 | 1.1% |
| Other values (2933) | 289412 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 244424 | 10.9% |
| e | 199313 | 8.9% |
| n | 165552 | 7.4% |
| o | 159648 | 7.1% |
| r | 146489 | 6.5% |
| i | 116092 | 5.2% |
| 107723 | 4.8% | |
| t | 103825 | 4.6% |
| s | 98320 | 4.4% |
| l | 98134 | 4.4% |
| Other values (59) | 803194 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1783043 | |
| Uppercase Letter | 339922 | 15.2% |
| Space Separator | 107723 | 4.8% |
| Other Punctuation | 7691 | 0.3% |
| Dash Punctuation | 3359 | 0.1% |
| Open Punctuation | 488 | < 0.1% |
| Close Punctuation | 487 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 244424 | |
| e | 199313 | |
| n | 165552 | |
| o | 159648 | |
| r | 146489 | 8.2% |
| i | 116092 | 6.5% |
| t | 103825 | 5.8% |
| s | 98320 | 5.5% |
| l | 98134 | 5.5% |
| u | 80396 | 4.5% |
| Other values (19) | 370850 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 46264 | |
| S | 28692 | 8.4% |
| A | 28447 | 8.4% |
| M | 26585 | 7.8% |
| B | 26532 | 7.8% |
| P | 24905 | 7.3% |
| D | 17542 | 5.2% |
| L | 16783 | 4.9% |
| N | 15007 | 4.4% |
| H | 14626 | 4.3% |
| Other values (16) | 94539 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 3565 | |
| / | 2007 | |
| . | 1654 | |
| ? | 462 | 6.0% |
| & | 2 | < 0.1% |
| , | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3298 | |
| – | 61 | 1.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 481 | |
| [ | 7 | 1.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 480 | |
| ] | 7 | 1.4% |
Space Separator
| Value | Count | Frequency (%) |
| 107723 |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2122965 | |
| Common | 119749 | 5.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 244424 | 11.5% |
| e | 199313 | 9.4% |
| n | 165552 | 7.8% |
| o | 159648 | 7.5% |
| r | 146489 | 6.9% |
| i | 116092 | 5.5% |
| t | 103825 | 4.9% |
| s | 98320 | 4.6% |
| l | 98134 | 4.6% |
| u | 80396 | 3.8% |
| Other values (45) | 710772 |
Common
| Value | Count | Frequency (%) |
| 107723 | ||
| ' | 3565 | 3.0% |
| - | 3298 | 2.8% |
| / | 2007 | 1.7% |
| . | 1654 | 1.4% |
| ( | 481 | 0.4% |
| ) | 480 | 0.4% |
| ? | 462 | 0.4% |
| – | 61 | 0.1% |
| [ | 7 | < 0.1% |
| Other values (4) | 11 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2242650 | |
| Punctuation | 61 | < 0.1% |
| None | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 244424 | 10.9% |
| e | 199313 | 8.9% |
| n | 165552 | 7.4% |
| o | 159648 | 7.1% |
| r | 146489 | 6.5% |
| i | 116092 | 5.2% |
| 107723 | 4.8% | |
| t | 103825 | 4.6% |
| s | 98320 | 4.4% |
| l | 98134 | 4.4% |
| Other values (55) | 803130 |
Punctuation
| Value | Count | Frequency (%) |
| – | 61 |
None
| Value | Count | Frequency (%) |
| ô | 1 | |
| é | 1 | |
| ä | 1 |
locality
Text
Missing 
| Distinct | 64255 |
|---|---|
| Distinct (%) | 13.5% |
| Missing | 107551 |
| Missing (%) | 18.4% |
| Memory size | 4.5 MiB |
Length
| Max length | 929 |
|---|---|
| Median length | 128 |
| Mean length | 17.88850853 |
| Min length | 1 |
Unique
| Unique | 33921 ? |
|---|---|
| Unique (%) | 7.1% |
Sample
| 1st row | Asuncion |
|---|---|
| 2nd row | Bryant, Near |
| 3rd row | Owl'S Head |
| 4th row | Nali Barrio, Dam Site, Quezon Municipality |
| 5th row | Fort Lee |
| Value | Count | Frequency (%) |
| island | 33520 | 2.4% |
| mi | 31811 | 2.3% |
| of | 23110 | 1.6% |
| river | 22675 | 1.6% |
| rio | 21864 | 1.6% |
| km | 18525 | 1.3% |
| fort | 14257 | 1.0% |
| san | 13196 | 0.9% |
| near | 13030 | 0.9% |
| lake | 11919 | 0.8% |
| Other values (33466) | 1203009 |
Most occurring characters
| Value | Count | Frequency (%) |
| 929876 | 10.9% | |
| a | 913886 | 10.7% |
| e | 542145 | 6.4% |
| o | 539938 | 6.3% |
| n | 524605 | 6.1% |
| i | 502610 | 5.9% |
| r | 415688 | 4.9% |
| l | 354250 | 4.2% |
| t | 336164 | 3.9% |
| s | 280759 | 3.3% |
| Other values (102) | 3193631 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5940218 | |
| Uppercase Letter | 1267138 | 14.8% |
| Space Separator | 929876 | 10.9% |
| Other Punctuation | 265001 | 3.1% |
| Decimal Number | 105220 | 1.2% |
| Dash Punctuation | 11355 | 0.1% |
| Open Punctuation | 5161 | 0.1% |
| Close Punctuation | 5158 | 0.1% |
| Math Symbol | 4354 | 0.1% |
| Connector Punctuation | 44 | < 0.1% |
| Other values (2) | 27 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 913886 | |
| e | 542145 | |
| o | 539938 | |
| n | 524605 | |
| i | 502610 | 8.5% |
| r | 415688 | 7.0% |
| l | 354250 | 6.0% |
| t | 336164 | 5.7% |
| s | 280759 | 4.7% |
| u | 248246 | 4.2% |
| Other values (36) | 1281927 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 138099 | 10.9% |
| C | 105764 | 8.3% |
| M | 93638 | 7.4% |
| B | 86701 | 6.8% |
| P | 86223 | 6.8% |
| R | 83609 | 6.6% |
| L | 71717 | 5.7% |
| N | 65417 | 5.2% |
| I | 56763 | 4.5% |
| A | 52124 | 4.1% |
| Other values (17) | 427083 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 231908 | |
| . | 22167 | 8.4% |
| ' | 6055 | 2.3% |
| ? | 1337 | 0.5% |
| / | 958 | 0.4% |
| " | 816 | 0.3% |
| : | 659 | 0.2% |
| # | 444 | 0.2% |
| & | 364 | 0.1% |
| ; | 283 | 0.1% |
| Other values (3) | 10 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 23097 | |
| 5 | 17642 | |
| 2 | 14487 | |
| 0 | 13388 | |
| 3 | 8220 | 7.8% |
| 4 | 6775 | 6.4% |
| 8 | 6256 | 5.9% |
| 7 | 5786 | 5.5% |
| 6 | 5236 | 5.0% |
| 9 | 4333 | 4.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 4291 | |
| + | 56 | 1.3% |
| ~ | 7 | 0.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3168 | |
| [ | 1992 | |
| { | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3167 | |
| ] | 1990 | |
| } | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 11354 | |
| – | 1 | < 0.1% |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 12 | |
| › | 3 | 20.0% |
Space Separator
| Value | Count | Frequency (%) |
| 929876 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 44 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7207356 | |
| Common | 1326196 | 15.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 913886 | 12.7% |
| e | 542145 | 7.5% |
| o | 539938 | 7.5% |
| n | 524605 | 7.3% |
| i | 502610 | 7.0% |
| r | 415688 | 5.8% |
| l | 354250 | 4.9% |
| t | 336164 | 4.7% |
| s | 280759 | 3.9% |
| u | 248246 | 3.4% |
| Other values (63) | 2549065 |
Common
| Value | Count | Frequency (%) |
| 929876 | ||
| , | 231908 | 17.5% |
| 1 | 23097 | 1.7% |
| . | 22167 | 1.7% |
| 5 | 17642 | 1.3% |
| 2 | 14487 | 1.1% |
| 0 | 13388 | 1.0% |
| - | 11354 | 0.9% |
| 3 | 8220 | 0.6% |
| 4 | 6775 | 0.5% |
| Other values (29) | 47282 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8533180 | |
| None | 344 | < 0.1% |
| Punctuation | 28 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 929876 | 10.9% | |
| a | 913886 | 10.7% |
| e | 542145 | 6.4% |
| o | 539938 | 6.3% |
| n | 524605 | 6.1% |
| i | 502610 | 5.9% |
| r | 415688 | 4.9% |
| l | 354250 | 4.2% |
| t | 336164 | 3.9% |
| s | 280759 | 3.3% |
| Other values (77) | 3193259 |
None
| Value | Count | Frequency (%) |
| ñ | 80 | |
| ô | 69 | |
| á | 58 | |
| í | 35 | |
| ā | 21 | 6.1% |
| é | 17 | 4.9% |
| ã | 13 | 3.8% |
| è | 10 | 2.9% |
| ú | 9 | 2.6% |
| ö | 8 | 2.3% |
| Other values (11) | 24 | 7.0% |
Punctuation
| Value | Count | Frequency (%) |
| ” | 12 | |
| “ | 12 | |
| › | 3 | 10.7% |
| – | 1 | 3.6% |
Missing 
| Distinct | 196 |
|---|---|
| Distinct (%) | 15.4% |
| Missing | 583323 |
| Missing (%) | 99.8% |
| Memory size | 4.5 MiB |
Length
| Max length | 84 |
|---|---|
| Median length | 9 |
| Mean length | 13.72813239 |
| Min length | 3 |
Unique
| Unique | 108 ? |
|---|---|
| Unique (%) | 8.5% |
Sample
| 1st row | altitude uncertain: label says both 5500 ft and 7000 ft |
|---|---|
| 2nd row | ca. 1050 m |
| 3rd row | ca. 4000 ft |
| 4th row | sea level |
| 5th row | 6230 ft |
| Value | Count | Frequency (%) |
| sea | 769 | |
| level | 769 | |
| ft | 409 | |
| ca | 177 | 4.8% |
| m | 115 | 3.1% |
| says | 114 | 3.1% |
| label | 100 | 2.7% |
| altitude | 92 | 2.5% |
| uncertain | 74 | 2.0% |
| of | 67 | 1.8% |
| Other values (170) | 986 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2820 | |
| 2403 | ||
| l | 1955 | |
| a | 1546 | |
| 0 | 1357 | 7.8% |
| s | 1076 | 6.2% |
| t | 881 | 5.1% |
| v | 812 | 4.7% |
| f | 520 | 3.0% |
| n | 353 | 2.0% |
| Other values (45) | 3698 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12005 | |
| Decimal Number | 2407 | 13.8% |
| Space Separator | 2403 | 13.8% |
| Other Punctuation | 415 | 2.4% |
| Math Symbol | 88 | 0.5% |
| Dash Punctuation | 72 | 0.4% |
| Uppercase Letter | 27 | 0.2% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2820 | |
| l | 1955 | |
| a | 1546 | |
| s | 1076 | 9.0% |
| t | 881 | 7.3% |
| v | 812 | 6.8% |
| f | 520 | 4.3% |
| n | 353 | 2.9% |
| c | 298 | 2.5% |
| i | 282 | 2.3% |
| Other values (14) | 1462 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1357 | |
| 1 | 245 | 10.2% |
| 5 | 213 | 8.8% |
| 6 | 133 | 5.5% |
| 2 | 119 | 4.9% |
| 3 | 102 | 4.2% |
| 8 | 92 | 3.8% |
| 9 | 57 | 2.4% |
| 4 | 54 | 2.2% |
| 7 | 35 | 1.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 9 | |
| L | 8 | |
| E | 4 | |
| A | 2 | 7.4% |
| O | 1 | 3.7% |
| C | 1 | 3.7% |
| I | 1 | 3.7% |
| B | 1 | 3.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 236 | |
| : | 99 | |
| , | 55 | 13.3% |
| ? | 17 | 4.1% |
| " | 4 | 1.0% |
| ; | 4 | 1.0% |
Math Symbol
| Value | Count | Frequency (%) |
| < | 34 | |
| > | 33 | |
| + | 21 |
Space Separator
| Value | Count | Frequency (%) |
| 2403 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 72 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12032 | |
| Common | 5389 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2820 | |
| l | 1955 | |
| a | 1546 | |
| s | 1076 | 8.9% |
| t | 881 | 7.3% |
| v | 812 | 6.7% |
| f | 520 | 4.3% |
| n | 353 | 2.9% |
| c | 298 | 2.5% |
| i | 282 | 2.3% |
| Other values (22) | 1489 |
Common
| Value | Count | Frequency (%) |
| 2403 | ||
| 0 | 1357 | |
| 1 | 245 | 4.5% |
| . | 236 | 4.4% |
| 5 | 213 | 4.0% |
| 6 | 133 | 2.5% |
| 2 | 119 | 2.2% |
| 3 | 102 | 1.9% |
| : | 99 | 1.8% |
| 8 | 92 | 1.7% |
| Other values (13) | 390 | 7.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17421 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2820 | |
| 2403 | ||
| l | 1955 | |
| a | 1546 | |
| 0 | 1357 | 7.8% |
| s | 1076 | 6.2% |
| t | 881 | 5.1% |
| v | 812 | 4.7% |
| f | 520 | 3.0% |
| n | 353 | 2.0% |
| Other values (45) | 3698 |
decimalLatitude
Text
Missing 
| Distinct | 3290 |
|---|---|
| Distinct (%) | 11.7% |
| Missing | 556566 |
| Missing (%) | 95.2% |
| Memory size | 4.5 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 5.238100335 |
| Min length | 3 |
Unique
| Unique | 1421 ? |
|---|---|
| Unique (%) | 5.1% |
Sample
| 1st row | 38.4236 |
|---|---|
| 2nd row | 5.85 |
| 3rd row | 7.97 |
| 4th row | 10.52 |
| 5th row | 0.35 |
| Value | Count | Frequency (%) |
| 34.9606 | 991 | 3.5% |
| 31.5011 | 663 | 2.4% |
| 9.03 | 592 | 2.1% |
| 8.25 | 507 | 1.8% |
| 6.45 | 506 | 1.8% |
| 29.3467 | 473 | 1.7% |
| 3.65 | 448 | 1.6% |
| 6.17 | 374 | 1.3% |
| 12.63 | 310 | 1.1% |
| 68.13 | 307 | 1.1% |
| Other values (3004) | 22855 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 28026 | |
| 3 | 14891 | |
| 1 | 14405 | |
| 5 | 12119 | |
| 6 | 11694 | |
| 8 | 11032 | 7.5% |
| 4 | 10580 | 7.2% |
| 7 | 10374 | 7.1% |
| 2 | 9852 | 6.7% |
| 0 | 9609 | 6.5% |
| Other values (2) | 14221 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 112692 | |
| Other Punctuation | 28026 | 19.1% |
| Dash Punctuation | 6085 | 4.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 14891 | |
| 1 | 14405 | |
| 5 | 12119 | |
| 6 | 11694 | |
| 8 | 11032 | |
| 4 | 10580 | |
| 7 | 10374 | |
| 2 | 9852 | |
| 0 | 9609 | |
| 9 | 8136 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 28026 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6085 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 146803 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 28026 | |
| 3 | 14891 | |
| 1 | 14405 | |
| 5 | 12119 | |
| 6 | 11694 | |
| 8 | 11032 | 7.5% |
| 4 | 10580 | 7.2% |
| 7 | 10374 | 7.1% |
| 2 | 9852 | 6.7% |
| 0 | 9609 | 6.5% |
| Other values (2) | 14221 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 146803 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 28026 | |
| 3 | 14891 | |
| 1 | 14405 | |
| 5 | 12119 | |
| 6 | 11694 | |
| 8 | 11032 | 7.5% |
| 4 | 10580 | 7.2% |
| 7 | 10374 | 7.1% |
| 2 | 9852 | 6.7% |
| 0 | 9609 | 6.5% |
| Other values (2) | 14221 |
decimalLongitude
Text
Missing 
| Distinct | 3651 |
|---|---|
| Distinct (%) | 13.0% |
| Missing | 556566 |
| Missing (%) | 95.2% |
| Memory size | 4.5 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 6.171804753 |
| Min length | 3 |
Unique
| Unique | 1668 ? |
|---|---|
| Unique (%) | 6.0% |
Sample
| 1st row | -79.2803 |
|---|---|
| 2nd row | 116.08 |
| 3rd row | -73.95 |
| 4th row | -75.02 |
| 5th row | -176.53 |
| Value | Count | Frequency (%) |
| 69.2778 | 991 | 3.5% |
| 65.8453 | 663 | 2.4% |
| 36.15 | 546 | 1.9% |
| 38.18 | 502 | 1.8% |
| 47.5206 | 473 | 1.7% |
| 34.58 | 464 | 1.7% |
| 52.37 | 452 | 1.6% |
| 37.5 | 368 | 1.3% |
| 165.95 | 307 | 1.1% |
| 74.08 | 295 | 1.1% |
| Other values (3513) | 22965 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 28026 | |
| 7 | 19303 | |
| 1 | 16358 | |
| - | 15622 | |
| 3 | 14216 | |
| 5 | 13864 | |
| 2 | 12920 | |
| 6 | 12744 | |
| 8 | 12375 | |
| 9 | 10013 | 5.8% |
| Other values (2) | 17530 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 129323 | |
| Other Punctuation | 28026 | 16.2% |
| Dash Punctuation | 15622 | 9.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 19303 | |
| 1 | 16358 | |
| 3 | 14216 | |
| 5 | 13864 | |
| 2 | 12920 | |
| 6 | 12744 | |
| 8 | 12375 | |
| 9 | 10013 | |
| 0 | 8870 | |
| 4 | 8660 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 28026 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15622 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 172971 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 28026 | |
| 7 | 19303 | |
| 1 | 16358 | |
| - | 15622 | |
| 3 | 14216 | |
| 5 | 13864 | |
| 2 | 12920 | |
| 6 | 12744 | |
| 8 | 12375 | |
| 9 | 10013 | 5.8% |
| Other values (2) | 17530 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 172971 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 28026 | |
| 7 | 19303 | |
| 1 | 16358 | |
| - | 15622 | |
| 3 | 14216 | |
| 5 | 13864 | |
| 2 | 12920 | |
| 6 | 12744 | |
| 8 | 12375 | |
| 9 | 10013 | 5.8% |
| Other values (2) | 17530 |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 567281 |
| Missing (%) | 97.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 22.88076945 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Degrees Minutes Seconds |
|---|---|
| 2nd row | Degrees Minutes Seconds |
| 3rd row | Degrees Minutes Seconds |
| 4th row | Degrees Minutes Seconds |
| 5th row | Degrees Minutes Seconds |
| Value | Count | Frequency (%) |
| degrees | 17208 | |
| minutes | 17206 | |
| seconds | 17206 | |
| utm | 100 | 0.2% |
| unknown | 3 | < 0.1% |
| decimal | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 86038 | |
| s | 51620 | |
| n | 34421 | 8.7% |
| 34414 | 8.7% | |
| M | 17306 | 4.4% |
| o | 17209 | 4.3% |
| D | 17208 | 4.3% |
| c | 17208 | 4.3% |
| g | 17208 | 4.3% |
| r | 17208 | 4.3% |
| Other values (12) | 86249 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 309752 | |
| Uppercase Letter | 51923 | 13.1% |
| Space Separator | 34414 | 8.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 86038 | |
| s | 51620 | |
| n | 34421 | |
| o | 17209 | 5.6% |
| c | 17208 | 5.6% |
| g | 17208 | 5.6% |
| r | 17208 | 5.6% |
| i | 17208 | 5.6% |
| d | 17208 | 5.6% |
| t | 17206 | 5.6% |
| Other values (6) | 17218 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 17306 | |
| D | 17208 | |
| S | 17206 | |
| U | 103 | 0.2% |
| T | 100 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 34414 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 361675 | |
| Common | 34414 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 86038 | |
| s | 51620 | |
| n | 34421 | |
| M | 17306 | 4.8% |
| o | 17209 | 4.8% |
| D | 17208 | 4.8% |
| c | 17208 | 4.8% |
| g | 17208 | 4.8% |
| r | 17208 | 4.8% |
| i | 17208 | 4.8% |
| Other values (11) | 69041 |
Common
| Value | Count | Frequency (%) |
| 34414 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 396089 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 86038 | |
| s | 51620 | |
| n | 34421 | 8.7% |
| 34414 | 8.7% | |
| M | 17306 | 4.4% |
| o | 17209 | 4.3% |
| D | 17208 | 4.3% |
| c | 17208 | 4.3% |
| g | 17208 | 4.3% |
| r | 17208 | 4.3% |
| Other values (12) | 86249 |
Missing 
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 583342 |
| Missing (%) | 99.8% |
| Memory size | 4.5 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 3 |
| Mean length | 7.1184 |
| Min length | 3 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | GEOLocate tool |
|---|---|
| 2nd row | GPS |
| 3rd row | Google Earth maps |
| 4th row | GPS |
| 5th row | GPS |
| Value | Count | Frequency (%) |
| gps | 739 | |
| earth | 195 | 10.4% |
| maps | 195 | 10.4% |
| 195 | 10.4% | |
| geolocate | 179 | 9.6% |
| tool | 179 | 9.6% |
| map | 109 | 5.8% |
| online | 18 | 1.0% |
| recorded | 15 | 0.8% |
| not | 15 | 0.8% |
| Other values (7) | 35 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| G | 1114 | |
| o | 988 | |
| P | 739 | 8.3% |
| S | 739 | 8.3% |
| a | 700 | 7.9% |
| 624 | 7.0% | |
| t | 582 | 6.5% |
| e | 436 | 4.9% |
| l | 413 | 4.6% |
| E | 374 | 4.2% |
| Other values (23) | 2189 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4838 | |
| Uppercase Letter | 3436 | |
| Space Separator | 624 | 7.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 988 | |
| a | 700 | |
| t | 582 | |
| e | 436 | |
| l | 413 | |
| p | 306 | 6.3% |
| m | 244 | 5.0% |
| r | 237 | 4.9% |
| c | 205 | 4.2% |
| g | 195 | 4.0% |
| Other values (12) | 532 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1114 | |
| P | 739 | |
| S | 739 | |
| E | 374 | 10.9% |
| O | 179 | 5.2% |
| L | 179 | 5.2% |
| M | 81 | 2.4% |
| U | 11 | 0.3% |
| C | 10 | 0.3% |
| T | 10 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 624 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8274 | |
| Common | 624 | 7.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 1114 | |
| o | 988 | |
| P | 739 | 8.9% |
| S | 739 | 8.9% |
| a | 700 | 8.5% |
| t | 582 | 7.0% |
| e | 436 | 5.3% |
| l | 413 | 5.0% |
| E | 374 | 4.5% |
| p | 306 | 3.7% |
| Other values (22) | 1883 |
Common
| Value | Count | Frequency (%) |
| 624 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8898 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| G | 1114 | |
| o | 988 | |
| P | 739 | 8.3% |
| S | 739 | 8.3% |
| a | 700 | 7.9% |
| 624 | 7.0% | |
| t | 582 | 6.5% |
| e | 436 | 4.9% |
| l | 413 | 4.6% |
| E | 374 | 4.2% |
| Other values (23) | 2189 |
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 583894 |
| Missing (%) | 99.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 9 |
| Mean length | 8.736389685 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | uncertain |
|---|---|
| 2nd row | uncertain |
| 3rd row | uncertain |
| 4th row | uncertain |
| 5th row | uncertain |
| Value | Count | Frequency (%) |
| uncertain | 663 | |
| cf | 29 | 4.1% |
| sp | 4 | 0.6% |
| aff | 4 | 0.6% |
| near | 2 | 0.3% |
| vel | 1 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 1328 | |
| c | 692 | |
| a | 669 | |
| e | 666 | |
| r | 665 | |
| u | 663 | |
| t | 663 | |
| i | 663 | |
| f | 37 | 0.6% |
| . | 37 | 0.6% |
| Other values (5) | 15 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6056 | |
| Other Punctuation | 37 | 0.6% |
| Space Separator | 5 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 1328 | |
| c | 692 | |
| a | 669 | |
| e | 666 | |
| r | 665 | |
| u | 663 | |
| t | 663 | |
| i | 663 | |
| f | 37 | 0.6% |
| s | 4 | 0.1% |
| Other values (3) | 6 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 37 |
Space Separator
| Value | Count | Frequency (%) |
| 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6056 | |
| Common | 42 | 0.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 1328 | |
| c | 692 | |
| a | 669 | |
| e | 666 | |
| r | 665 | |
| u | 663 | |
| t | 663 | |
| i | 663 | |
| f | 37 | 0.6% |
| s | 4 | 0.1% |
| Other values (3) | 6 | 0.1% |
Common
| Value | Count | Frequency (%) |
| . | 37 | |
| 5 | 11.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6098 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 1328 | |
| c | 692 | |
| a | 669 | |
| e | 666 | |
| r | 665 | |
| u | 663 | |
| t | 663 | |
| i | 663 | |
| f | 37 | 0.6% |
| . | 37 | 0.6% |
| Other values (5) | 15 | 0.2% |
typeStatus
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 580632 |
| Missing (%) | 99.3% |
| Memory size | 4.5 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 4 |
| Mean length | 4.607323232 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | COTYPE |
|---|---|
| 2nd row | TYPE |
| 3rd row | TYPE |
| 4th row | TYPE |
| 5th row | TYPE |
| Value | Count | Frequency (%) |
| type | 2759 | |
| cotype | 1200 | |
| lectotype | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 3961 | |
| E | 3961 | |
| Y | 3960 | |
| P | 3960 | |
| C | 1201 | 6.6% |
| O | 1201 | 6.6% |
| L | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 18245 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 3961 | |
| E | 3961 | |
| Y | 3960 | |
| P | 3960 | |
| C | 1201 | 6.6% |
| O | 1201 | 6.6% |
| L | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18245 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 3961 | |
| E | 3961 | |
| Y | 3960 | |
| P | 3960 | |
| C | 1201 | 6.6% |
| O | 1201 | 6.6% |
| L | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18245 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| T | 3961 | |
| E | 3961 | |
| Y | 3960 | |
| P | 3960 | |
| C | 1201 | 6.6% |
| O | 1201 | 6.6% |
| L | 1 | < 0.1% |
identifiedBy
Text
Missing 
| Distinct | 69 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 581206 |
| Missing (%) | 99.4% |
| Memory size | 4.5 MiB |
Length
| Max length | 129 |
|---|---|
| Median length | 18 |
| Mean length | 24.97489663 |
| Min length | 9 |
Unique
| Unique | 22 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | Wetmore, Alexander |
|---|---|
| 2nd row | Maley, James M, Collections Manager, Occidental College - Moore Laboratory of Zoology (UNITED STATES) |
| 3rd row | Wetmore, Alexander |
| 4th row | Verhelst, Juan C |
| 5th row | Clark, W. S. |
| Value | Count | Frequency (%) |
| wetmore | 2393 | |
| alexander | 2382 | |
| of | 294 | 2.7% |
| 268 | 2.5% | |
| united | 266 | 2.4% |
| states | 265 | 2.4% |
| museum | 246 | 2.3% |
| history | 200 | 1.8% |
| natural | 200 | 1.8% |
| birds | 198 | 1.8% |
| Other values (178) | 4219 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 11582 | |
| 7545 | 8.9% | |
| r | 6594 | 7.8% |
| a | 5098 | 6.0% |
| o | 5033 | 6.0% |
| t | 4517 | 5.3% |
| n | 4224 | 5.0% |
| l | 4082 | 4.8% |
| , | 3962 | 4.7% |
| m | 3194 | 3.8% |
| Other values (50) | 28734 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 57333 | |
| Uppercase Letter | 13878 | 16.4% |
| Space Separator | 7545 | 8.9% |
| Other Punctuation | 4577 | 5.4% |
| Close Punctuation | 477 | 0.6% |
| Open Punctuation | 477 | 0.6% |
| Dash Punctuation | 270 | 0.3% |
| Decimal Number | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 11582 | |
| r | 6594 | |
| a | 5098 | |
| o | 5033 | |
| t | 4517 | 7.9% |
| n | 4224 | 7.4% |
| l | 4082 | 7.1% |
| m | 3194 | 5.6% |
| d | 2638 | 4.6% |
| x | 2382 | 4.2% |
| Other values (16) | 7989 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2751 | |
| W | 2686 | |
| S | 1048 | 7.6% |
| I | 824 | 5.9% |
| T | 808 | 5.8% |
| M | 697 | 5.0% |
| N | 690 | 5.0% |
| C | 642 | 4.6% |
| D | 607 | 4.4% |
| E | 558 | 4.0% |
| Other values (14) | 2567 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 9 | 2 | |
| 5 | 2 | |
| 0 | 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3962 | |
| . | 615 | 13.4% |
Space Separator
| Value | Count | Frequency (%) |
| 7545 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 477 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 477 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 270 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 71211 | |
| Common | 13354 | 15.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 11582 | |
| r | 6594 | 9.3% |
| a | 5098 | 7.2% |
| o | 5033 | 7.1% |
| t | 4517 | 6.3% |
| n | 4224 | 5.9% |
| l | 4082 | 5.7% |
| m | 3194 | 4.5% |
| A | 2751 | 3.9% |
| W | 2686 | 3.8% |
| Other values (40) | 21450 |
Common
| Value | Count | Frequency (%) |
| 7545 | ||
| , | 3962 | |
| . | 615 | 4.6% |
| ) | 477 | 3.6% |
| ( | 477 | 3.6% |
| - | 270 | 2.0% |
| 1 | 2 | < 0.1% |
| 9 | 2 | < 0.1% |
| 5 | 2 | < 0.1% |
| 0 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 84564 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 11582 | |
| 7545 | 8.9% | |
| r | 6594 | 7.8% |
| a | 5098 | 6.0% |
| o | 5033 | 6.0% |
| t | 4517 | 5.3% |
| n | 4224 | 5.0% |
| l | 4082 | 4.8% |
| , | 3962 | 4.7% |
| m | 3194 | 3.8% |
| Other values (49) | 28733 |
None
| Value | Count | Frequency (%) |
| à | 1 |
| Distinct | 18485 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.009553672 |
| Min length | 3 |
Unique
| Unique | 2480 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 2492087 |
|---|---|
| 2nd row | 2480415 |
| 3rd row | 2481705 |
| 4th row | 9367409 |
| 5th row | 5229959 |
| Value | Count | Frequency (%) |
| 9409198 | 2991 | 0.5% |
| 7192429 | 1918 | 0.3% |
| 7191991 | 1808 | 0.3% |
| 9685907 | 1565 | 0.3% |
| 9791464 | 1425 | 0.2% |
| 7341805 | 1363 | 0.2% |
| 2489985 | 1286 | 0.2% |
| 5231142 | 1245 | 0.2% |
| 2473421 | 1244 | 0.2% |
| 2489670 | 1187 | 0.2% |
| Other values (18475) | 568560 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 550899 | |
| 4 | 487888 | |
| 1 | 454466 | |
| 9 | 450768 | |
| 7 | 438367 | |
| 8 | 395864 | |
| 6 | 370734 | |
| 0 | 331876 | |
| 5 | 312473 | |
| 3 | 304394 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4097729 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 550899 | |
| 4 | 487888 | |
| 1 | 454466 | |
| 9 | 450768 | |
| 7 | 438367 | |
| 8 | 395864 | |
| 6 | 370734 | |
| 0 | 331876 | |
| 5 | 312473 | |
| 3 | 304394 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4097729 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 550899 | |
| 4 | 487888 | |
| 1 | 454466 | |
| 9 | 450768 | |
| 7 | 438367 | |
| 8 | 395864 | |
| 6 | 370734 | |
| 0 | 331876 | |
| 5 | 312473 | |
| 3 | 304394 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4097729 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 550899 | |
| 4 | 487888 | |
| 1 | 454466 | |
| 9 | 450768 | |
| 7 | 438367 | |
| 8 | 395864 | |
| 6 | 370734 | |
| 0 | 331876 | |
| 5 | 312473 | |
| 3 | 304394 |
scientificName
Text
| Distinct | 18875 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 101 |
|---|---|
| Median length | 68 |
| Mean length | 36.39443065 |
| Min length | 4 |
Unique
| Unique | 2553 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Paroaria capitata (d'Orbigny & Lafresnaye, 1837) |
|---|---|
| 2nd row | Rostrhamus sociabilis (Vieillot, 1817) |
| 3rd row | Bartramia longicauda (Bechstein, 1812) |
| 4th row | Sterna hirundo Linnaeus, 1758 |
| 5th row | Prionochilus plateni W.Blasius, 1888 |
| Value | Count | Frequency (%) |
| linnaeus | 95179 | 3.9% |
| 1758 | 62131 | 2.5% |
| 1766 | 31804 | 1.3% |
| 1789 | 23736 | 1.0% |
| 21524 | 0.9% | |
| vieillot | 20514 | 0.8% |
| j.f.gmelin | 17875 | 0.7% |
| ridgway | 14989 | 0.6% |
| dendroica | 14825 | 0.6% |
| gmelin | 12921 | 0.5% |
| Other values (11256) | 2141359 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1872265 | 8.8% | |
| a | 1760824 | 8.3% |
| i | 1561286 | 7.3% |
| s | 1382114 | 6.5% |
| e | 1247644 | 5.9% |
| n | 1111280 | 5.2% |
| r | 1081068 | 5.1% |
| u | 984426 | 4.6% |
| l | 968653 | 4.6% |
| o | 962179 | 4.5% |
| Other values (68) | 8344154 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15008719 | |
| Decimal Number | 1926600 | 9.1% |
| Space Separator | 1872265 | 8.8% |
| Uppercase Letter | 1240000 | 5.8% |
| Other Punctuation | 643869 | 3.0% |
| Open Punctuation | 290919 | 1.4% |
| Close Punctuation | 290919 | 1.4% |
| Dash Punctuation | 2602 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1760824 | |
| i | 1561286 | |
| s | 1382114 | |
| e | 1247644 | 8.3% |
| n | 1111280 | 7.4% |
| r | 1081068 | 7.2% |
| u | 984426 | 6.6% |
| l | 968653 | 6.5% |
| o | 962179 | 6.4% |
| t | 711082 | 4.7% |
| Other values (23) | 3238163 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 162966 | |
| P | 118987 | 9.6% |
| S | 116593 | 9.4% |
| C | 116417 | 9.4% |
| G | 77102 | 6.2% |
| A | 76812 | 6.2% |
| B | 72288 | 5.8% |
| T | 64274 | 5.2% |
| M | 63743 | 5.1% |
| R | 48447 | 3.9% |
| Other values (17) | 322371 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 566928 | |
| 8 | 422023 | |
| 7 | 236330 | |
| 9 | 153919 | 8.0% |
| 6 | 130112 | 6.8% |
| 5 | 120708 | 6.3% |
| 3 | 84592 | 4.4% |
| 2 | 76423 | 4.0% |
| 0 | 68473 | 3.6% |
| 4 | 67092 | 3.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 481794 | |
| . | 139512 | 21.7% |
| & | 21524 | 3.3% |
| ' | 1039 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 1872265 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 290919 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 290919 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2602 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16248719 | |
| Common | 5027174 | 23.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1760824 | 10.8% |
| i | 1561286 | 9.6% |
| s | 1382114 | 8.5% |
| e | 1247644 | 7.7% |
| n | 1111280 | 6.8% |
| r | 1081068 | 6.7% |
| u | 984426 | 6.1% |
| l | 968653 | 6.0% |
| o | 962179 | 5.9% |
| t | 711082 | 4.4% |
| Other values (50) | 4478163 |
Common
| Value | Count | Frequency (%) |
| 1872265 | ||
| 1 | 566928 | 11.3% |
| , | 481794 | 9.6% |
| 8 | 422023 | 8.4% |
| ( | 290919 | 5.8% |
| ) | 290919 | 5.8% |
| 7 | 236330 | 4.7% |
| 9 | 153919 | 3.1% |
| . | 139512 | 2.8% |
| 6 | 130112 | 2.6% |
| Other values (8) | 442453 | 8.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21269847 | |
| None | 6046 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1872265 | 8.8% | |
| a | 1760824 | 8.3% |
| i | 1561286 | 7.3% |
| s | 1382114 | 6.5% |
| e | 1247644 | 5.9% |
| n | 1111280 | 5.2% |
| r | 1081068 | 5.1% |
| u | 984426 | 4.6% |
| l | 968653 | 4.6% |
| o | 962179 | 4.5% |
| Other values (60) | 8338108 |
None
| Value | Count | Frequency (%) |
| ü | 4335 | |
| é | 883 | 14.6% |
| á | 360 | 6.0% |
| è | 250 | 4.1% |
| ö | 103 | 1.7% |
| ä | 90 | 1.5% |
| É | 17 | 0.3% |
| ø | 8 | 0.1% |
| Distinct | 185 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 89 |
|---|---|
| Median length | 78 |
| Mean length | 65.97973972 |
| Min length | 45 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Animalia, Chordata, Vertebrata, Aves, Passeriformes, Emberizidae, Emberizinae |
|---|---|
| 2nd row | Animalia, Chordata, Vertebrata, Aves, Falconiformes, Accipitridae |
| 3rd row | Animalia, Chordata, Vertebrata, Aves, Charadriiformes, Scolopacidae |
| 4th row | Animalia, Chordata, Vertebrata, Aves, Charadriiformes, Laridae |
| 5th row | Animalia, Chordata, Vertebrata, Aves, Passeriformes, Dicaeidae |
| Value | Count | Frequency (%) |
| animalia | 584592 | |
| aves | 584592 | |
| chordata | 584592 | |
| vertebrata | 584592 | |
| passeriformes | 372479 | |
| emberizidae | 72754 | 2.0% |
| emberizinae | 50573 | 1.4% |
| charadriiformes | 44080 | 1.2% |
| parulidae | 36362 | 1.0% |
| tyrannidae | 27497 | 0.8% |
| Other values (206) | 702489 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 5107701 | |
| e | 3704731 | 9.6% |
| r | 3367631 | 8.7% |
| 3060010 | 7.9% | |
| , | 3060009 | 7.9% |
| i | 3035379 | 7.9% |
| s | 1981990 | 5.1% |
| t | 1944511 | 5.0% |
| o | 1467327 | 3.8% |
| m | 1357393 | 3.5% |
| Other values (37) | 10484546 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 28806607 | |
| Uppercase Letter | 3644601 | 9.4% |
| Space Separator | 3060010 | 7.9% |
| Other Punctuation | 3060010 | 7.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 5107701 | |
| e | 3704731 | |
| r | 3367631 | |
| i | 3035379 | |
| s | 1981990 | 6.9% |
| t | 1944511 | 6.8% |
| o | 1467327 | 5.1% |
| m | 1357393 | 4.7% |
| d | 1347875 | 4.7% |
| n | 998616 | 3.5% |
| Other values (13) | 4493453 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1269397 | |
| C | 741968 | |
| V | 592119 | |
| P | 536883 | |
| E | 129534 | 3.6% |
| T | 114606 | 3.1% |
| S | 71084 | 2.0% |
| F | 49758 | 1.4% |
| M | 26183 | 0.7% |
| G | 25043 | 0.7% |
| Other values (11) | 88026 | 2.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3060009 | |
| ? | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 3060010 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32451208 | |
| Common | 6120020 | 15.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 5107701 | |
| e | 3704731 | |
| r | 3367631 | |
| i | 3035379 | 9.4% |
| s | 1981990 | 6.1% |
| t | 1944511 | 6.0% |
| o | 1467327 | 4.5% |
| m | 1357393 | 4.2% |
| d | 1347875 | 4.2% |
| A | 1269397 | 3.9% |
| Other values (34) | 7867273 |
Common
| Value | Count | Frequency (%) |
| 3060010 | ||
| , | 3060009 | |
| ? | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38571228 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 5107701 | |
| e | 3704731 | 9.6% |
| r | 3367631 | 8.7% |
| 3060010 | 7.9% | |
| , | 3060009 | 7.9% |
| i | 3035379 | 7.9% |
| s | 1981990 | 5.1% |
| t | 1944511 | 5.0% |
| o | 1467327 | 3.8% |
| m | 1357393 | 3.5% |
| Other values (37) | 10484546 |
kingdom
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Animalia |
|---|---|
| 2nd row | Animalia |
| 3rd row | Animalia |
| 4th row | Animalia |
| 5th row | Animalia |
| Value | Count | Frequency (%) |
| animalia | 584592 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 1169184 | |
| a | 1169184 | |
| A | 584592 | |
| n | 584592 | |
| m | 584592 | |
| l | 584592 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4092144 | |
| Uppercase Letter | 584592 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1169184 | |
| a | 1169184 | |
| n | 584592 | |
| m | 584592 | |
| l | 584592 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 584592 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4676736 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 1169184 | |
| a | 1169184 | |
| A | 584592 | |
| n | 584592 | |
| m | 584592 | |
| l | 584592 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4676736 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 1169184 | |
| a | 1169184 | |
| A | 584592 | |
| n | 584592 | |
| m | 584592 | |
| l | 584592 |
phylum
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Chordata |
|---|---|
| 2nd row | Chordata |
| 3rd row | Chordata |
| 4th row | Chordata |
| 5th row | Chordata |
| Value | Count | Frequency (%) |
| chordata | 584587 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1169174 | |
| C | 584587 | |
| h | 584587 | |
| o | 584587 | |
| r | 584587 | |
| d | 584587 | |
| t | 584587 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4092109 | |
| Uppercase Letter | 584587 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1169174 | |
| h | 584587 | |
| o | 584587 | |
| r | 584587 | |
| d | 584587 | |
| t | 584587 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 584587 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4676696 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1169174 | |
| C | 584587 | |
| h | 584587 | |
| o | 584587 | |
| r | 584587 | |
| d | 584587 | |
| t | 584587 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4676696 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1169174 | |
| C | 584587 | |
| h | 584587 | |
| o | 584587 | |
| r | 584587 | |
| d | 584587 | |
| t | 584587 |
class
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Aves |
|---|---|
| 2nd row | Aves |
| 3rd row | Aves |
| 4th row | Aves |
| 5th row | Aves |
| Value | Count | Frequency (%) |
| aves | 584587 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 584587 | |
| v | 584587 | |
| e | 584587 | |
| s | 584587 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1753761 | |
| Uppercase Letter | 584587 | 25.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| v | 584587 | |
| e | 584587 | |
| s | 584587 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 584587 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2338348 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 584587 | |
| v | 584587 | |
| e | 584587 | |
| s | 584587 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2338348 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 584587 | |
| v | 584587 | |
| e | 584587 | |
| s | 584587 |
order
Text
| Distinct | 42 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 20 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 13 |
| Mean length | 12.96889006 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Passeriformes |
|---|---|
| 2nd row | Accipitriformes |
| 3rd row | Charadriiformes |
| 4th row | Charadriiformes |
| 5th row | Passeriformes |
| Value | Count | Frequency (%) |
| passeriformes | 372474 | |
| charadriiformes | 44387 | 7.6% |
| piciformes | 22599 | 3.9% |
| apodiformes | 18185 | 3.1% |
| anseriformes | 15668 | 2.7% |
| galliformes | 14813 | 2.5% |
| columbiformes | 12800 | 2.2% |
| accipitriformes | 11414 | 2.0% |
| coraciiformes | 7822 | 1.3% |
| psittaciformes | 7419 | 1.3% |
| Other values (32) | 56991 | 9.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 1353697 | |
| r | 1116871 | |
| e | 999308 | |
| i | 716230 | |
| o | 644987 | |
| m | 602388 | |
| f | 584572 | |
| a | 516843 | 6.8% |
| P | 419526 | 5.5% |
| c | 90235 | 1.2% |
| Other values (24) | 536593 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6996678 | |
| Uppercase Letter | 584572 | 7.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 1353697 | |
| r | 1116871 | |
| e | 999308 | |
| i | 716230 | |
| o | 644987 | |
| m | 602388 | |
| f | 584572 | |
| a | 516843 | 7.4% |
| c | 90235 | 1.3% |
| l | 82324 | 1.2% |
| Other values (10) | 289223 | 4.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 419526 | |
| C | 75814 | 13.0% |
| A | 45305 | 7.8% |
| G | 22108 | 3.8% |
| S | 11813 | 2.0% |
| F | 4459 | 0.8% |
| T | 3057 | 0.5% |
| B | 1625 | 0.3% |
| M | 348 | 0.1% |
| O | 237 | < 0.1% |
| Other values (4) | 280 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7581250 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 1353697 | |
| r | 1116871 | |
| e | 999308 | |
| i | 716230 | |
| o | 644987 | |
| m | 602388 | |
| f | 584572 | |
| a | 516843 | 6.8% |
| P | 419526 | 5.5% |
| c | 90235 | 1.2% |
| Other values (24) | 536593 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7581250 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 1353697 | |
| r | 1116871 | |
| e | 999308 | |
| i | 716230 | |
| o | 644987 | |
| m | 602388 | |
| f | 584572 | |
| a | 516843 | 6.8% |
| P | 419526 | 5.5% |
| c | 90235 | 1.2% |
| Other values (24) | 536593 | 7.1% |
family
Text
| Distinct | 239 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 16 |
| Mean length | 10.42056366 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Thraupidae |
|---|---|
| 2nd row | Accipitridae |
| 3rd row | Scolopacidae |
| 4th row | Laridae |
| 5th row | Dicaeidae |
| Value | Count | Frequency (%) |
| passerellidae | 39435 | 6.7% |
| parulidae | 34481 | 5.9% |
| tyrannidae | 26165 | 4.5% |
| icteridae | 19964 | 3.4% |
| thraupidae | 18114 | 3.1% |
| picidae | 17391 | 3.0% |
| fringillidae | 17014 | 2.9% |
| scolopacidae | 16651 | 2.8% |
| turdidae | 16039 | 2.7% |
| anatidae | 15579 | 2.7% |
| Other values (229) | 363742 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 939318 | |
| i | 882003 | |
| e | 764759 | |
| d | 672252 | |
| r | 396820 | 6.5% |
| l | 338348 | 5.6% |
| c | 231479 | 3.8% |
| o | 225694 | 3.7% |
| n | 207894 | 3.4% |
| t | 157040 | 2.6% |
| Other values (32) | 1275994 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5507026 | |
| Uppercase Letter | 584575 | 9.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 939318 | |
| i | 882003 | |
| e | 764759 | |
| d | 672252 | |
| r | 396820 | |
| l | 338348 | 6.1% |
| c | 231479 | 4.2% |
| o | 225694 | 4.1% |
| n | 207894 | 3.8% |
| t | 157040 | 2.9% |
| Other values (11) | 691419 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 154750 | |
| T | 101314 | |
| C | 66304 | |
| A | 52928 | 9.1% |
| S | 37881 | 6.5% |
| M | 33527 | 5.7% |
| F | 30447 | 5.2% |
| L | 22426 | 3.8% |
| I | 20551 | 3.5% |
| R | 11420 | 2.0% |
| Other values (11) | 53027 | 9.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6091601 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 939318 | |
| i | 882003 | |
| e | 764759 | |
| d | 672252 | |
| r | 396820 | 6.5% |
| l | 338348 | 5.6% |
| c | 231479 | 3.8% |
| o | 225694 | 3.7% |
| n | 207894 | 3.4% |
| t | 157040 | 2.6% |
| Other values (32) | 1275994 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6091601 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 939318 | |
| i | 882003 | |
| e | 764759 | |
| d | 672252 | |
| r | 396820 | 6.5% |
| l | 338348 | 5.6% |
| c | 231479 | 3.8% |
| o | 225694 | 3.7% |
| n | 207894 | 3.4% |
| t | 157040 | 2.6% |
| Other values (32) | 1275994 |
genus
Text
| Distinct | 2196 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 338 |
| Missing (%) | 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 15 |
| Mean length | 8.640262626 |
| Min length | 3 |
Unique
| Unique | 84 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Paroaria |
|---|---|
| 2nd row | Rostrhamus |
| 3rd row | Bartramia |
| 4th row | Sterna |
| 5th row | Prionochilus |
| Value | Count | Frequency (%) |
| setophaga | 18301 | 3.1% |
| melospiza | 7103 | 1.2% |
| turdus | 6838 | 1.2% |
| calidris | 6684 | 1.1% |
| vireo | 6403 | 1.1% |
| agelaius | 5379 | 0.9% |
| catharus | 4885 | 0.8% |
| junco | 4780 | 0.8% |
| geothlypis | 4423 | 0.8% |
| zonotrichia | 4075 | 0.7% |
| Other values (2186) | 515383 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 525335 | 10.4% |
| o | 417081 | 8.3% |
| i | 391104 | 7.7% |
| s | 389715 | 7.7% |
| r | 334572 | 6.6% |
| e | 326852 | 6.5% |
| u | 295820 | 5.9% |
| l | 272317 | 5.4% |
| t | 236060 | 4.7% |
| n | 223789 | 4.4% |
| Other values (42) | 1635463 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4463854 | |
| Uppercase Letter | 584254 | 11.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 525335 | |
| o | 417081 | 9.3% |
| i | 391104 | 8.8% |
| s | 389715 | 8.7% |
| r | 334572 | 7.5% |
| e | 326852 | 7.3% |
| u | 295820 | 6.6% |
| l | 272317 | 6.1% |
| t | 236060 | 5.3% |
| n | 223789 | 5.0% |
| Other values (16) | 1051209 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 89334 | |
| P | 84753 | |
| S | 65896 | |
| A | 55035 | |
| M | 47839 | |
| T | 41913 | 7.2% |
| L | 30152 | 5.2% |
| E | 23489 | 4.0% |
| G | 17415 | 3.0% |
| H | 17156 | 2.9% |
| Other values (16) | 111272 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5048108 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 525335 | 10.4% |
| o | 417081 | 8.3% |
| i | 391104 | 7.7% |
| s | 389715 | 7.7% |
| r | 334572 | 6.6% |
| e | 326852 | 6.5% |
| u | 295820 | 5.9% |
| l | 272317 | 5.4% |
| t | 236060 | 4.7% |
| n | 223789 | 4.4% |
| Other values (42) | 1635463 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5048108 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 525335 | 10.4% |
| o | 417081 | 8.3% |
| i | 391104 | 7.7% |
| s | 389715 | 7.7% |
| r | 334572 | 6.6% |
| e | 326852 | 6.5% |
| u | 295820 | 5.9% |
| l | 272317 | 5.4% |
| t | 236060 | 4.7% |
| n | 223789 | 4.4% |
| Other values (42) | 1635463 |
genericName
Text
| Distinct | 2024 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 495 |
| Missing (%) | 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 15 |
| Mean length | 8.461623669 |
| Min length | 3 |
Unique
| Unique | 81 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Paroaria |
|---|---|
| 2nd row | Rostrhamus |
| 3rd row | Bartramia |
| 4th row | Sterna |
| 5th row | Prionochilus |
| Value | Count | Frequency (%) |
| dendroica | 14825 | 2.5% |
| parus | 7485 | 1.3% |
| melospiza | 7103 | 1.2% |
| turdus | 6813 | 1.2% |
| vireo | 6403 | 1.1% |
| calidris | 6372 | 1.1% |
| sterna | 6184 | 1.1% |
| agelaius | 5525 | 0.9% |
| carduelis | 5507 | 0.9% |
| picoides | 5086 | 0.9% |
| Other values (2014) | 512794 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 519400 | 10.5% |
| i | 397930 | 8.1% |
| o | 386413 | 7.8% |
| s | 382857 | 7.7% |
| r | 365575 | 7.4% |
| u | 308704 | 6.2% |
| e | 306794 | 6.2% |
| l | 267331 | 5.4% |
| n | 223958 | 4.5% |
| c | 212647 | 4.3% |
| Other values (42) | 1570800 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4358312 | |
| Uppercase Letter | 584097 | 11.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 519400 | |
| i | 397930 | |
| o | 386413 | 8.9% |
| s | 382857 | 8.8% |
| r | 365575 | 8.4% |
| u | 308704 | 7.1% |
| e | 306794 | 7.0% |
| l | 267331 | 6.1% |
| n | 223958 | 5.1% |
| c | 212647 | 4.9% |
| Other values (16) | 986703 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 92459 | |
| P | 87930 | |
| A | 56987 | |
| S | 48616 | |
| M | 44863 | 7.7% |
| T | 42410 | 7.3% |
| D | 28038 | 4.8% |
| L | 25718 | 4.4% |
| E | 22717 | 3.9% |
| G | 16758 | 2.9% |
| Other values (16) | 117601 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4942409 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 519400 | 10.5% |
| i | 397930 | 8.1% |
| o | 386413 | 7.8% |
| s | 382857 | 7.7% |
| r | 365575 | 7.4% |
| u | 308704 | 6.2% |
| e | 306794 | 6.2% |
| l | 267331 | 5.4% |
| n | 223958 | 4.5% |
| c | 212647 | 4.3% |
| Other values (42) | 1570800 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4942409 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 519400 | 10.5% |
| i | 397930 | 8.1% |
| o | 386413 | 7.8% |
| s | 382857 | 7.7% |
| r | 365575 | 7.4% |
| u | 308704 | 6.2% |
| e | 306794 | 6.2% |
| l | 267331 | 5.4% |
| n | 223958 | 4.5% |
| c | 212647 | 4.3% |
| Other values (42) | 1570800 |
specificEpithet
Text
Missing 
| Distinct | 4643 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 7917 |
| Missing (%) | 1.4% |
| Memory size | 4.5 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 16 |
| Mean length | 8.786944119 |
| Min length | 3 |
Unique
| Unique | 322 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | capitata |
|---|---|
| 2nd row | sociabilis |
| 3rd row | longicauda |
| 4th row | hirundo |
| 5th row | plateni |
| Value | Count | Frequency (%) |
| melodia | 5111 | 0.9% |
| phoeniceus | 4986 | 0.9% |
| hyemalis | 4880 | 0.8% |
| americana | 4671 | 0.8% |
| canadensis | 3833 | 0.7% |
| sandwichensis | 3774 | 0.7% |
| pusilla | 3572 | 0.6% |
| alpestris | 3345 | 0.6% |
| olivaceus | 3301 | 0.6% |
| carolinensis | 3295 | 0.6% |
| Other values (4633) | 535907 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 623579 | |
| i | 556345 | |
| s | 506337 | |
| u | 361837 | 7.1% |
| r | 360464 | 7.1% |
| e | 352637 | 7.0% |
| l | 331573 | 6.5% |
| n | 305381 | 6.0% |
| c | 304652 | 6.0% |
| o | 272602 | 5.4% |
| Other values (16) | 1091804 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5067211 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 623579 | |
| i | 556345 | |
| s | 506337 | |
| u | 361837 | 7.1% |
| r | 360464 | 7.1% |
| e | 352637 | 7.0% |
| l | 331573 | 6.5% |
| n | 305381 | 6.0% |
| c | 304652 | 6.0% |
| o | 272602 | 5.4% |
| Other values (16) | 1091804 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5067211 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 623579 | |
| i | 556345 | |
| s | 506337 | |
| u | 361837 | 7.1% |
| r | 360464 | 7.1% |
| e | 352637 | 7.0% |
| l | 331573 | 6.5% |
| n | 305381 | 6.0% |
| c | 304652 | 6.0% |
| o | 272602 | 5.4% |
| Other values (16) | 1091804 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5067211 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 623579 | |
| i | 556345 | |
| s | 506337 | |
| u | 361837 | 7.1% |
| r | 360464 | 7.1% |
| e | 352637 | 7.0% |
| l | 331573 | 6.5% |
| n | 305381 | 6.0% |
| c | 304652 | 6.0% |
| o | 272602 | 5.4% |
| Other values (16) | 1091804 |
Missing 
| Distinct | 6225 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 308675 |
| Missing (%) | 52.8% |
| Memory size | 4.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 16 |
| Mean length | 8.918026073 |
| Min length | 2 |
Unique
| Unique | 702 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | solitarius |
|---|---|
| 2nd row | flavoolivaceus |
| 3rd row | satrapa |
| 4th row | australis |
| 5th row | malherbii |
| Value | Count | Frequency (%) |
| carolinensis | 1803 | 0.7% |
| olivaceus | 1259 | 0.5% |
| pinus | 1235 | 0.4% |
| occidentalis | 1175 | 0.4% |
| coronata | 1165 | 0.4% |
| pusilla | 1144 | 0.4% |
| flammea | 1046 | 0.4% |
| arizonae | 1029 | 0.4% |
| hyemalis | 1005 | 0.4% |
| frontalis | 1004 | 0.4% |
| Other values (6215) | 264052 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 290816 | |
| a | 278989 | |
| s | 254771 | |
| e | 190758 | 7.8% |
| r | 173209 | 7.0% |
| n | 170468 | 6.9% |
| u | 157384 | 6.4% |
| l | 149284 | 6.1% |
| o | 136638 | 5.6% |
| c | 130973 | 5.3% |
| Other values (17) | 527345 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2460622 | |
| Dash Punctuation | 13 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 290816 | |
| a | 278989 | |
| s | 254771 | |
| e | 190758 | 7.8% |
| r | 173209 | 7.0% |
| n | 170468 | 6.9% |
| u | 157384 | 6.4% |
| l | 149284 | 6.1% |
| o | 136638 | 5.6% |
| c | 130973 | 5.3% |
| Other values (16) | 527332 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 13 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2460622 | |
| Common | 13 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 290816 | |
| a | 278989 | |
| s | 254771 | |
| e | 190758 | 7.8% |
| r | 173209 | 7.0% |
| n | 170468 | 6.9% |
| u | 157384 | 6.4% |
| l | 149284 | 6.1% |
| o | 136638 | 5.6% |
| c | 130973 | 5.3% |
| Other values (16) | 527332 |
Common
| Value | Count | Frequency (%) |
| - | 13 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2460635 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 290816 | |
| a | 278989 | |
| s | 254771 | |
| e | 190758 | 7.8% |
| r | 173209 | 7.0% |
| n | 170468 | 6.9% |
| u | 157384 | 6.4% |
| l | 149284 | 6.1% |
| o | 136638 | 5.6% |
| c | 130973 | 5.3% |
| Other values (17) | 527345 |
taxonRank
Text
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 8.389954019 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | SPECIES |
|---|---|
| 2nd row | SPECIES |
| 3rd row | SPECIES |
| 4th row | SPECIES |
| 5th row | SPECIES |
| Value | Count | Frequency (%) |
| species | 300916 | |
| subspecies | 275917 | |
| genus | 7422 | 1.3% |
| family | 324 | 0.1% |
| class | 12 | < 0.1% |
| form | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 1437029 | |
| E | 1161088 | |
| I | 577157 | |
| C | 576845 | |
| P | 576833 | |
| U | 283339 | 5.8% |
| B | 275917 | 5.6% |
| G | 7422 | 0.2% |
| N | 7422 | 0.2% |
| A | 336 | < 0.1% |
| Other values (6) | 1312 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4904700 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1437029 | |
| E | 1161088 | |
| I | 577157 | |
| C | 576845 | |
| P | 576833 | |
| U | 283339 | 5.8% |
| B | 275917 | 5.6% |
| G | 7422 | 0.2% |
| N | 7422 | 0.2% |
| A | 336 | < 0.1% |
| Other values (6) | 1312 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4904700 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 1437029 | |
| E | 1161088 | |
| I | 577157 | |
| C | 576845 | |
| P | 576833 | |
| U | 283339 | 5.8% |
| B | 275917 | 5.6% |
| G | 7422 | 0.2% |
| N | 7422 | 0.2% |
| A | 336 | < 0.1% |
| Other values (6) | 1312 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4904700 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 1437029 | |
| E | 1161088 | |
| I | 577157 | |
| C | 576845 | |
| P | 576833 | |
| U | 283339 | 5.8% |
| B | 275917 | 5.6% |
| G | 7422 | 0.2% |
| N | 7422 | 0.2% |
| A | 336 | < 0.1% |
| Other values (6) | 1312 | < 0.1% |
taxonomicStatus
Text
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.793247256 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ACCEPTED |
|---|---|
| 2nd row | ACCEPTED |
| 3rd row | ACCEPTED |
| 4th row | ACCEPTED |
| 5th row | ACCEPTED |
| Value | Count | Frequency (%) |
| accepted | 463081 | |
| synonym | 120866 | 20.7% |
| doubtful | 645 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 926162 | |
| E | 926162 | |
| T | 463726 | |
| D | 463726 | |
| A | 463081 | |
| P | 463081 | |
| Y | 241732 | 5.3% |
| N | 241732 | 5.3% |
| O | 121511 | 2.7% |
| S | 120866 | 2.7% |
| Other values (5) | 124091 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4555870 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 926162 | |
| E | 926162 | |
| T | 463726 | |
| D | 463726 | |
| A | 463081 | |
| P | 463081 | |
| Y | 241732 | 5.3% |
| N | 241732 | 5.3% |
| O | 121511 | 2.7% |
| S | 120866 | 2.7% |
| Other values (5) | 124091 | 2.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4555870 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 926162 | |
| E | 926162 | |
| T | 463726 | |
| D | 463726 | |
| A | 463081 | |
| P | 463081 | |
| Y | 241732 | 5.3% |
| N | 241732 | 5.3% |
| O | 121511 | 2.7% |
| S | 120866 | 2.7% |
| Other values (5) | 124091 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4555870 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 926162 | |
| E | 926162 | |
| T | 463726 | |
| D | 463726 | |
| A | 463081 | |
| P | 463081 | |
| Y | 241732 | 5.3% |
| N | 241732 | 5.3% |
| O | 121511 | 2.7% |
| S | 120866 | 2.7% |
| Other values (5) | 124091 | 2.7% |
datasetKey
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
|---|---|
| 2nd row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 3rd row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 4th row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 5th row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| Value | Count | Frequency (%) |
| 821cc27a-e3bb-4bc5-ac34-89ada245069d | 584592 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 2338368 | |
| a | 2338368 | |
| - | 2338368 | |
| 2 | 1753776 | |
| b | 1753776 | |
| 4 | 1753776 | |
| 8 | 1169184 | 5.6% |
| 3 | 1169184 | 5.6% |
| 5 | 1169184 | 5.6% |
| 9 | 1169184 | 5.6% |
| Other values (6) | 4092144 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10522656 | |
| Lowercase Letter | 8184288 | |
| Dash Punctuation | 2338368 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1753776 | |
| 4 | 1753776 | |
| 8 | 1169184 | |
| 3 | 1169184 | |
| 5 | 1169184 | |
| 9 | 1169184 | |
| 1 | 584592 | 5.6% |
| 7 | 584592 | 5.6% |
| 0 | 584592 | 5.6% |
| 6 | 584592 | 5.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 2338368 | |
| a | 2338368 | |
| b | 1753776 | |
| d | 1169184 | |
| e | 584592 | 7.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2338368 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12861024 | |
| Latin | 8184288 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 2338368 | |
| 2 | 1753776 | |
| 4 | 1753776 | |
| 8 | 1169184 | |
| 3 | 1169184 | |
| 5 | 1169184 | |
| 9 | 1169184 | |
| 1 | 584592 | 4.5% |
| 7 | 584592 | 4.5% |
| 0 | 584592 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| c | 2338368 | |
| a | 2338368 | |
| b | 1753776 | |
| d | 1169184 | |
| e | 584592 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21045312 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 2338368 | |
| a | 2338368 | |
| - | 2338368 | |
| 2 | 1753776 | |
| b | 1753776 | |
| 4 | 1753776 | |
| 8 | 1169184 | 5.6% |
| 3 | 1169184 | 5.6% |
| 5 | 1169184 | 5.6% |
| 9 | 1169184 | 5.6% |
| Other values (6) | 4092144 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 584592 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 584592 | |
| S | 584592 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1169184 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 584592 | |
| S | 584592 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1169184 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 584592 | |
| S | 584592 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1169184 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 584592 | |
| S | 584592 |
lastInterpreted
Text
| Distinct | 183965 |
|---|---|
| Distinct (%) | 31.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99608616 |
| Min length | 20 |
Unique
| Unique | 40346 ? |
|---|---|
| Unique (%) | 6.9% |
Sample
| 1st row | 2024-12-02T13:56:05.137Z |
|---|---|
| 2nd row | 2024-12-02T13:56:08.067Z |
| 3rd row | 2024-12-02T13:59:48.585Z |
| 4th row | 2024-12-02T13:56:09.311Z |
| 5th row | 2024-12-02T13:58:24.805Z |
| Value | Count | Frequency (%) |
| 2024-12-02t13:57:59.341z | 17 | < 0.1% |
| 2024-12-02t13:57:45.007z | 16 | < 0.1% |
| 2024-12-02t13:57:38.028z | 16 | < 0.1% |
| 2024-12-02t13:57:53.841z | 16 | < 0.1% |
| 2024-12-02t13:57:44.964z | 15 | < 0.1% |
| 2024-12-02t13:58:02.321z | 15 | < 0.1% |
| 2024-12-02t13:57:53.332z | 15 | < 0.1% |
| 2024-12-02t13:57:51.208z | 15 | < 0.1% |
| 2024-12-02t13:58:02.659z | 15 | < 0.1% |
| 2024-12-02t13:57:41.116z | 15 | < 0.1% |
| Other values (183955) | 584437 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2670609 | |
| 0 | 1482175 | |
| 1 | 1475322 | |
| - | 1169184 | |
| : | 1169184 | |
| 4 | 940152 | 6.7% |
| 5 | 927783 | 6.6% |
| 3 | 925430 | 6.6% |
| T | 584592 | 4.2% |
| Z | 584592 | 4.2% |
| Other values (5) | 2098897 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9936348 | |
| Other Punctuation | 1753204 | 12.5% |
| Dash Punctuation | 1169184 | 8.3% |
| Uppercase Letter | 1169184 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2670609 | |
| 0 | 1482175 | |
| 1 | 1475322 | |
| 4 | 940152 | 9.5% |
| 5 | 927783 | 9.3% |
| 3 | 925430 | 9.3% |
| 7 | 449478 | 4.5% |
| 9 | 373966 | 3.8% |
| 6 | 351326 | 3.5% |
| 8 | 340107 | 3.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1169184 | |
| . | 584020 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 584592 | |
| Z | 584592 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1169184 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12858736 | |
| Latin | 1169184 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2670609 | |
| 0 | 1482175 | |
| 1 | 1475322 | |
| - | 1169184 | |
| : | 1169184 | |
| 4 | 940152 | 7.3% |
| 5 | 927783 | 7.2% |
| 3 | 925430 | 7.2% |
| . | 584020 | 4.5% |
| 7 | 449478 | 3.5% |
| Other values (3) | 1065399 | 8.3% |
Latin
| Value | Count | Frequency (%) |
| T | 584592 | |
| Z | 584592 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14027920 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2670609 | |
| 0 | 1482175 | |
| 1 | 1475322 | |
| - | 1169184 | |
| : | 1169184 | |
| 4 | 940152 | 6.7% |
| 5 | 927783 | 6.6% |
| 3 | 925430 | 6.6% |
| T | 584592 | 4.2% |
| Z | 584592 | 4.2% |
| Other values (5) | 2098897 |
elevation
Text
Missing 
| Distinct | 1379 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 498000 |
| Missing (%) | 85.2% |
| Memory size | 4.5 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.453991131 |
| Min length | 3 |
Unique
| Unique | 391 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | 1040.0 |
|---|---|
| 2nd row | 655.0 |
| 3rd row | 1524.0 |
| 4th row | 30.0 |
| 5th row | 220.0 |
| Value | Count | Frequency (%) |
| 1829.0 | 2382 | 2.8% |
| 914.0 | 2016 | 2.3% |
| 1219.0 | 1941 | 2.2% |
| 610.0 | 1879 | 2.2% |
| 1524.0 | 1853 | 2.1% |
| 1676.0 | 1775 | 2.0% |
| 2134.0 | 1668 | 1.9% |
| 305.0 | 1650 | 1.9% |
| 1067.0 | 1237 | 1.4% |
| 1372.0 | 1235 | 1.4% |
| Other values (1368) | 68956 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 121330 | |
| . | 86592 | |
| 1 | 60345 | |
| 2 | 39201 | 8.3% |
| 5 | 31691 | 6.7% |
| 3 | 26681 | 5.6% |
| 6 | 22707 | 4.8% |
| 4 | 22205 | 4.7% |
| 7 | 21516 | 4.6% |
| 8 | 20688 | 4.4% |
| Other values (2) | 19316 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 385678 | |
| Other Punctuation | 86592 | 18.3% |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 121330 | |
| 1 | 60345 | |
| 2 | 39201 | 10.2% |
| 5 | 31691 | 8.2% |
| 3 | 26681 | 6.9% |
| 6 | 22707 | 5.9% |
| 4 | 22205 | 5.8% |
| 7 | 21516 | 5.6% |
| 8 | 20688 | 5.4% |
| 9 | 19314 | 5.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 86592 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 472272 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 121330 | |
| . | 86592 | |
| 1 | 60345 | |
| 2 | 39201 | 8.3% |
| 5 | 31691 | 6.7% |
| 3 | 26681 | 5.6% |
| 6 | 22707 | 4.8% |
| 4 | 22205 | 4.7% |
| 7 | 21516 | 4.6% |
| 8 | 20688 | 4.4% |
| Other values (2) | 19316 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 472272 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 121330 | |
| . | 86592 | |
| 1 | 60345 | |
| 2 | 39201 | 8.3% |
| 5 | 31691 | 6.7% |
| 3 | 26681 | 5.6% |
| 6 | 22707 | 4.8% |
| 4 | 22205 | 4.7% |
| 7 | 21516 | 4.6% |
| 8 | 20688 | 4.4% |
| Other values (2) | 19316 | 4.1% |
Missing 
| Distinct | 89 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 574752 |
| Missing (%) | 98.3% |
| Memory size | 4.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.386788618 |
| Min length | 3 |
Unique
| Unique | 18 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 38.0 |
|---|---|
| 2nd row | 76.0 |
| 3rd row | 76.5 |
| 4th row | 106.5 |
| 5th row | 152.0 |
| Value | Count | Frequency (%) |
| 152.5 | 2223 | |
| 76.0 | 1274 | |
| 76.5 | 1047 | 10.6% |
| 30.5 | 536 | 5.4% |
| 45.5 | 426 | 4.3% |
| 61.0 | 404 | 4.1% |
| 0.0 | 394 | 4.0% |
| 106.5 | 310 | 3.2% |
| 91.5 | 290 | 2.9% |
| 46.0 | 265 | 2.7% |
| Other values (79) | 2671 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 9840 | |
| 5 | 9285 | |
| 0 | 6214 | |
| 1 | 4673 | |
| 2 | 3784 | 8.8% |
| 6 | 3342 | 7.7% |
| 7 | 2713 | 6.3% |
| 3 | 1154 | 2.7% |
| 4 | 917 | 2.1% |
| 8 | 711 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 33326 | |
| Other Punctuation | 9840 | 22.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 9285 | |
| 0 | 6214 | |
| 1 | 4673 | |
| 2 | 3784 | |
| 6 | 3342 | 10.0% |
| 7 | 2713 | 8.1% |
| 3 | 1154 | 3.5% |
| 4 | 917 | 2.8% |
| 8 | 711 | 2.1% |
| 9 | 533 | 1.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 9840 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 43166 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 9840 | |
| 5 | 9285 | |
| 0 | 6214 | |
| 1 | 4673 | |
| 2 | 3784 | 8.8% |
| 6 | 3342 | 7.7% |
| 7 | 2713 | 6.3% |
| 3 | 1154 | 2.7% |
| 4 | 917 | 2.1% |
| 8 | 711 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 43166 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 9840 | |
| 5 | 9285 | |
| 0 | 6214 | |
| 1 | 4673 | |
| 2 | 3784 | 8.8% |
| 6 | 3342 | 7.7% |
| 7 | 2713 | 6.3% |
| 3 | 1154 | 2.7% |
| 4 | 917 | 2.1% |
| 8 | 711 | 1.6% |
distanceFromCentroidInMeters
Text
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 62.5% |
| Missing | 584584 |
| Missing (%) | > 99.9% |
| Memory size | 4.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 17.125 |
| Min length | 16 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 37.5% |
Sample
| 1st row | 368.745418614193 |
|---|---|
| 2nd row | 918.1358064728217 |
| 3rd row | 4411.160071289899 |
| 4th row | 4391.045588808231 |
| 5th row | 4411.160071289899 |
| Value | Count | Frequency (%) |
| 4411.160071289899 | 3 | |
| 2413.9981382897595 | 2 | |
| 368.745418614193 | 1 | 12.5% |
| 918.1358064728217 | 1 | 12.5% |
| 4391.045588808231 | 1 | 12.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 24 | |
| 8 | 21 | |
| 9 | 20 | |
| 4 | 14 | |
| 2 | 10 | |
| 0 | 9 | 6.6% |
| 3 | 9 | 6.6% |
| . | 8 | 5.8% |
| 7 | 8 | 5.8% |
| 5 | 8 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 129 | |
| Other Punctuation | 8 | 5.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 24 | |
| 8 | 21 | |
| 9 | 20 | |
| 4 | 14 | |
| 2 | 10 | |
| 0 | 9 | 7.0% |
| 3 | 9 | 7.0% |
| 7 | 8 | 6.2% |
| 5 | 8 | 6.2% |
| 6 | 6 | 4.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 137 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 24 | |
| 8 | 21 | |
| 9 | 20 | |
| 4 | 14 | |
| 2 | 10 | |
| 0 | 9 | 6.6% |
| 3 | 9 | 6.6% |
| . | 8 | 5.8% |
| 7 | 8 | 5.8% |
| 5 | 8 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 137 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 24 | |
| 8 | 21 | |
| 9 | 20 | |
| 4 | 14 | |
| 2 | 10 | |
| 0 | 9 | 6.6% |
| 3 | 9 | 6.6% |
| . | 8 | 5.8% |
| 7 | 8 | 5.8% |
| 5 | 8 | 5.8% |
issue
Text
| Distinct | 74 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 186 |
|---|---|
| Median length | 48 |
| Mean length | 53.02144744 |
| Min length | 48 |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
|---|---|
| 2nd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| 3rd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| 4th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| 5th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| Value | Count | Frequency (%) |
| occurrence_status_inferred_from_individual_count | 488030 | |
| occurrence_status_inferred_from_individual_count;taxon_match_higherrank | 40818 | 7.0% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84 | 19491 | 3.3% |
| occurrence_status_inferred_from_individual_count;taxon_match_fuzzy | 14228 | 2.4% |
| occurrence_status_inferred_from_individual_count;continent_derived_from_country;continent_invalid | 10628 | 1.8% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;taxon_match_higherrank | 2362 | 0.4% |
| occurrence_status_inferred_from_individual_count;country_derived_from_coordinates;geodetic_datum_assumed_wgs84;continent_invalid | 2250 | 0.4% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_invalid | 1482 | 0.3% |
| occurrence_status_inferred_from_individual_count;continent_derived_from_country;continent_invalid;taxon_match_higherrank | 777 | 0.1% |
| occurrence_status_inferred_from_individual_count;continent_country_mismatch | 752 | 0.1% |
| Other values (64) | 3774 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 3192048 | |
| R | 3060927 | |
| N | 2568512 | 8.3% |
| E | 2532321 | 8.2% |
| I | 2495611 | 8.1% |
| C | 2478423 | 8.0% |
| U | 2425637 | 7.8% |
| T | 2012103 | 6.5% |
| O | 1910196 | 6.2% |
| D | 1890173 | 6.1% |
| Other values (17) | 6429963 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 27626359 | |
| Connector Punctuation | 3192048 | 10.3% |
| Other Punctuation | 121455 | 0.4% |
| Decimal Number | 56052 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 3060927 | |
| N | 2568512 | |
| E | 2532321 | |
| I | 2495611 | |
| C | 2478423 | |
| U | 2425637 | |
| T | 2012103 | |
| O | 1910196 | 6.9% |
| D | 1890173 | 6.8% |
| A | 1412809 | 5.1% |
| Other values (13) | 4839647 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 28026 | |
| 4 | 28026 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3192048 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 121455 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 27626359 | |
| Common | 3369555 | 10.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 3060927 | |
| N | 2568512 | |
| E | 2532321 | |
| I | 2495611 | |
| C | 2478423 | |
| U | 2425637 | |
| T | 2012103 | |
| O | 1910196 | 6.9% |
| D | 1890173 | 6.8% |
| A | 1412809 | 5.1% |
| Other values (13) | 4839647 |
Common
| Value | Count | Frequency (%) |
| _ | 3192048 | |
| ; | 121455 | 3.6% |
| 8 | 28026 | 0.8% |
| 4 | 28026 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30995914 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| _ | 3192048 | |
| R | 3060927 | |
| N | 2568512 | 8.3% |
| E | 2532321 | 8.2% |
| I | 2495611 | 8.1% |
| C | 2478423 | 8.0% |
| U | 2425637 | 7.8% |
| T | 2012103 | 6.5% |
| O | 1910196 | 6.2% |
| D | 1890173 | 6.1% |
| Other values (17) | 6429963 |
mediaType
Text
Missing 
| Distinct | 65 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 26095 |
| Missing (%) | 4.5% |
| Memory size | 4.5 MiB |
Length
| Max length | 1165 |
|---|---|
| Median length | 10 |
| Mean length | 10.99926231 |
| Min length | 10 |
Unique
| Unique | 23 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | StillImage |
|---|---|
| 2nd row | StillImage |
| 3rd row | StillImage |
| 4th row | StillImage |
| 5th row | StillImage |
| Value | Count | Frequency (%) |
| stillimage | 544064 | |
| stillimage;stillimage | 6302 | 1.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 4028 | 0.7% |
| stillimage;stillimage;stillimage;stillimage | 1341 | 0.2% |
| stillimage;stillimage;stillimage | 1085 | 0.2% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 446 | 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage | 299 | 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 160 | < 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 119 | < 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 99 | < 0.1% |
| Other values (55) | 554 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 1218464 | |
| S | 609232 | |
| t | 609232 | |
| i | 609232 | |
| I | 609232 | |
| m | 609232 | |
| a | 609232 | |
| g | 609232 | |
| e | 609232 | |
| ; | 50735 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4873856 | |
| Uppercase Letter | 1218464 | 19.8% |
| Other Punctuation | 50735 | 0.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 1218464 | |
| t | 609232 | |
| i | 609232 | |
| m | 609232 | |
| a | 609232 | |
| g | 609232 | |
| e | 609232 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 609232 | |
| I | 609232 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 50735 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6092320 | |
| Common | 50735 | 0.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 1218464 | |
| S | 609232 | |
| t | 609232 | |
| i | 609232 | |
| I | 609232 | |
| m | 609232 | |
| a | 609232 | |
| g | 609232 | |
| e | 609232 |
Common
| Value | Count | Frequency (%) |
| ; | 50735 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6143055 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 1218464 | |
| S | 609232 | |
| t | 609232 | |
| i | 609232 | |
| I | 609232 | |
| m | 609232 | |
| a | 609232 | |
| g | 609232 | |
| e | 609232 | |
| ; | 50735 | 0.8% |
hasCoordinate
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.952058872 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 556566 | |
| true | 28026 | 4.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 584592 | |
| f | 556566 | |
| a | 556566 | |
| l | 556566 | |
| s | 556566 | |
| t | 28026 | 1.0% |
| r | 28026 | 1.0% |
| u | 28026 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2894934 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 584592 | |
| f | 556566 | |
| a | 556566 | |
| l | 556566 | |
| s | 556566 | |
| t | 28026 | 1.0% |
| r | 28026 | 1.0% |
| u | 28026 | 1.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2894934 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 584592 | |
| f | 556566 | |
| a | 556566 | |
| l | 556566 | |
| s | 556566 | |
| t | 28026 | 1.0% |
| r | 28026 | 1.0% |
| u | 28026 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2894934 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 584592 | |
| f | 556566 | |
| a | 556566 | |
| l | 556566 | |
| s | 556566 | |
| t | 28026 | 1.0% |
| r | 28026 | 1.0% |
| u | 28026 | 1.0% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.999095095 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 584063 | |
| true | 529 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 584592 | |
| f | 584063 | |
| a | 584063 | |
| l | 584063 | |
| s | 584063 | |
| t | 529 | < 0.1% |
| r | 529 | < 0.1% |
| u | 529 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2922431 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 584592 | |
| f | 584063 | |
| a | 584063 | |
| l | 584063 | |
| s | 584063 | |
| t | 529 | < 0.1% |
| r | 529 | < 0.1% |
| u | 529 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2922431 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 584592 | |
| f | 584063 | |
| a | 584063 | |
| l | 584063 | |
| s | 584063 | |
| t | 529 | < 0.1% |
| r | 529 | < 0.1% |
| u | 529 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2922431 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 584592 | |
| f | 584063 | |
| a | 584063 | |
| l | 584063 | |
| s | 584063 | |
| t | 529 | < 0.1% |
| r | 529 | < 0.1% |
| u | 529 | < 0.1% |
taxonKey
Text
| Distinct | 18875 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.002856693 |
| Min length | 3 |
Unique
| Unique | 2553 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 2492087 |
|---|---|
| 2nd row | 2480415 |
| 3rd row | 2481705 |
| 4th row | 9367409 |
| 5th row | 5229959 |
| Value | Count | Frequency (%) |
| 9409198 | 2991 | 0.5% |
| 5229252 | 1915 | 0.3% |
| 9685907 | 1565 | 0.3% |
| 9791464 | 1425 | 0.2% |
| 2489985 | 1281 | 0.2% |
| 5231142 | 1245 | 0.2% |
| 2473421 | 1244 | 0.2% |
| 2489670 | 1187 | 0.2% |
| 7191634 | 1155 | 0.2% |
| 2489730 | 1077 | 0.2% |
| Other values (18865) | 569507 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 589019 | |
| 4 | 482025 | |
| 1 | 468053 | |
| 7 | 445367 | |
| 9 | 442904 | |
| 8 | 390063 | |
| 6 | 370970 | |
| 5 | 314508 | |
| 0 | 304387 | |
| 3 | 286518 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4093814 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 589019 | |
| 4 | 482025 | |
| 1 | 468053 | |
| 7 | 445367 | |
| 9 | 442904 | |
| 8 | 390063 | |
| 6 | 370970 | |
| 5 | 314508 | |
| 0 | 304387 | |
| 3 | 286518 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4093814 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 589019 | |
| 4 | 482025 | |
| 1 | 468053 | |
| 7 | 445367 | |
| 9 | 442904 | |
| 8 | 390063 | |
| 6 | 370970 | |
| 5 | 314508 | |
| 0 | 304387 | |
| 3 | 286518 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4093814 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 589019 | |
| 4 | 482025 | |
| 1 | 468053 | |
| 7 | 445367 | |
| 9 | 442904 | |
| 8 | 390063 | |
| 6 | 370970 | |
| 5 | 314508 | |
| 0 | 304387 | |
| 3 | 286518 |
acceptedTaxonKey
Text
| Distinct | 18485 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.009553672 |
| Min length | 3 |
Unique
| Unique | 2480 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 2492087 |
|---|---|
| 2nd row | 2480415 |
| 3rd row | 2481705 |
| 4th row | 9367409 |
| 5th row | 5229959 |
| Value | Count | Frequency (%) |
| 9409198 | 2991 | 0.5% |
| 7192429 | 1918 | 0.3% |
| 7191991 | 1808 | 0.3% |
| 9685907 | 1565 | 0.3% |
| 9791464 | 1425 | 0.2% |
| 7341805 | 1363 | 0.2% |
| 2489985 | 1286 | 0.2% |
| 5231142 | 1245 | 0.2% |
| 2473421 | 1244 | 0.2% |
| 2489670 | 1187 | 0.2% |
| Other values (18475) | 568560 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 550899 | |
| 4 | 487888 | |
| 1 | 454466 | |
| 9 | 450768 | |
| 7 | 438367 | |
| 8 | 395864 | |
| 6 | 370734 | |
| 0 | 331876 | |
| 5 | 312473 | |
| 3 | 304394 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4097729 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 550899 | |
| 4 | 487888 | |
| 1 | 454466 | |
| 9 | 450768 | |
| 7 | 438367 | |
| 8 | 395864 | |
| 6 | 370734 | |
| 0 | 331876 | |
| 5 | 312473 | |
| 3 | 304394 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4097729 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 550899 | |
| 4 | 487888 | |
| 1 | 454466 | |
| 9 | 450768 | |
| 7 | 438367 | |
| 8 | 395864 | |
| 6 | 370734 | |
| 0 | 331876 | |
| 5 | 312473 | |
| 3 | 304394 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4097729 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 550899 | |
| 4 | 487888 | |
| 1 | 454466 | |
| 9 | 450768 | |
| 7 | 438367 | |
| 8 | 395864 | |
| 6 | 370734 | |
| 0 | 331876 | |
| 5 | 312473 | |
| 3 | 304394 |
kingdomKey
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 584592 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 584592 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 584592 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 584592 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 584592 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 584592 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 584592 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 584592 |
phylumKey
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 44 |
|---|---|
| 2nd row | 44 |
| 3rd row | 44 |
| 4th row | 44 |
| 5th row | 44 |
| Value | Count | Frequency (%) |
| 44 | 584587 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 1169174 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1169174 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 1169174 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1169174 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 1169174 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1169174 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 1169174 |
classKey
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 212 |
|---|---|
| 2nd row | 212 |
| 3rd row | 212 |
| 4th row | 212 |
| 5th row | 212 |
| Value | Count | Frequency (%) |
| 212 | 584587 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1169174 | |
| 1 | 584587 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1753761 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1169174 | |
| 1 | 584587 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1753761 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 1169174 | |
| 1 | 584587 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1753761 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 1169174 | |
| 1 | 584587 |
orderKey
Text
| Distinct | 42 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 20 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 3.739311838 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 729 |
|---|---|
| 2nd row | 7191147 |
| 3rd row | 7192402 |
| 4th row | 7192402 |
| 5th row | 729 |
| Value | Count | Frequency (%) |
| 729 | 372474 | |
| 7192402 | 44387 | 7.6% |
| 724 | 22599 | 3.9% |
| 1448 | 18185 | 3.1% |
| 1108 | 15668 | 2.7% |
| 723 | 14813 | 2.5% |
| 1446 | 12800 | 2.2% |
| 7191147 | 11414 | 2.0% |
| 1447 | 7822 | 1.3% |
| 1445 | 7419 | 1.3% |
| Other values (32) | 56991 | 9.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 536873 | |
| 2 | 518543 | |
| 9 | 478674 | |
| 1 | 217733 | |
| 4 | 205606 | 9.4% |
| 0 | 84766 | 3.9% |
| 5 | 50334 | 2.3% |
| 8 | 44059 | 2.0% |
| 3 | 29305 | 1.3% |
| 6 | 20004 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2185897 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 536873 | |
| 2 | 518543 | |
| 9 | 478674 | |
| 1 | 217733 | |
| 4 | 205606 | 9.4% |
| 0 | 84766 | 3.9% |
| 5 | 50334 | 2.3% |
| 8 | 44059 | 2.0% |
| 3 | 29305 | 1.3% |
| 6 | 20004 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2185897 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 536873 | |
| 2 | 518543 | |
| 9 | 478674 | |
| 1 | 217733 | |
| 4 | 205606 | 9.4% |
| 0 | 84766 | 3.9% |
| 5 | 50334 | 2.3% |
| 8 | 44059 | 2.0% |
| 3 | 29305 | 1.3% |
| 6 | 20004 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2185897 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 536873 | |
| 2 | 518543 | |
| 9 | 478674 | |
| 1 | 217733 | |
| 4 | 205606 | 9.4% |
| 0 | 84766 | 3.9% |
| 5 | 50334 | 2.3% |
| 8 | 44059 | 2.0% |
| 3 | 29305 | 1.3% |
| 6 | 20004 | 0.9% |
familyKey
Text
| Distinct | 239 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17 |
| Missing (%) | < 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 4 |
| Mean length | 4.376314416 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 9352 |
|---|---|
| 2nd row | 2877 |
| 3rd row | 5282 |
| 4th row | 9316 |
| 5th row | 4287160 |
| Value | Count | Frequency (%) |
| 9410667 | 39435 | 6.7% |
| 5263 | 34481 | 5.9% |
| 5291 | 26165 | 4.5% |
| 6176 | 19964 | 3.4% |
| 9352 | 18114 | 3.1% |
| 9333 | 17391 | 3.0% |
| 5242 | 17014 | 2.9% |
| 5282 | 16651 | 2.8% |
| 5290 | 16039 | 2.7% |
| 2986 | 15579 | 2.7% |
| Other values (229) | 363742 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 436950 | |
| 5 | 376699 | |
| 9 | 375236 | |
| 3 | 343158 | |
| 6 | 246705 | |
| 1 | 193745 | |
| 8 | 150651 | 5.9% |
| 0 | 147912 | 5.8% |
| 7 | 147562 | 5.8% |
| 4 | 139666 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2558284 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 436950 | |
| 5 | 376699 | |
| 9 | 375236 | |
| 3 | 343158 | |
| 6 | 246705 | |
| 1 | 193745 | |
| 8 | 150651 | 5.9% |
| 0 | 147912 | 5.8% |
| 7 | 147562 | 5.8% |
| 4 | 139666 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2558284 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 436950 | |
| 5 | 376699 | |
| 9 | 375236 | |
| 3 | 343158 | |
| 6 | 246705 | |
| 1 | 193745 | |
| 8 | 150651 | 5.9% |
| 0 | 147912 | 5.8% |
| 7 | 147562 | 5.8% |
| 4 | 139666 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2558284 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 436950 | |
| 5 | 376699 | |
| 9 | 375236 | |
| 3 | 343158 | |
| 6 | 246705 | |
| 1 | 193745 | |
| 8 | 150651 | 5.9% |
| 0 | 147912 | 5.8% |
| 7 | 147562 | 5.8% |
| 4 | 139666 | 5.5% |
genusKey
Text
| Distinct | 2196 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 338 |
| Missing (%) | 0.1% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.005993968 |
| Min length | 7 |
Unique
| Unique | 84 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2492080 |
|---|---|
| 2nd row | 2480414 |
| 3rd row | 2481704 |
| 4th row | 2481227 |
| 5th row | 2484660 |
| Value | Count | Frequency (%) |
| 2489984 | 18301 | 3.1% |
| 2492191 | 7103 | 1.2% |
| 2490714 | 6838 | 1.2% |
| 2481739 | 6684 | 1.1% |
| 2487406 | 6403 | 1.1% |
| 2484444 | 5379 | 0.9% |
| 2490799 | 4885 | 0.8% |
| 2492009 | 4780 | 0.8% |
| 2489637 | 4423 | 0.8% |
| 6173226 | 4075 | 0.7% |
| Other values (2186) | 515383 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 811117 | |
| 2 | 756156 | |
| 8 | 500110 | |
| 9 | 498768 | |
| 7 | 305303 | 7.5% |
| 1 | 276534 | 6.8% |
| 3 | 259496 | 6.3% |
| 6 | 243474 | 5.9% |
| 0 | 240481 | 5.9% |
| 5 | 201841 | 4.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4093280 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 811117 | |
| 2 | 756156 | |
| 8 | 500110 | |
| 9 | 498768 | |
| 7 | 305303 | 7.5% |
| 1 | 276534 | 6.8% |
| 3 | 259496 | 6.3% |
| 6 | 243474 | 5.9% |
| 0 | 240481 | 5.9% |
| 5 | 201841 | 4.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4093280 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 811117 | |
| 2 | 756156 | |
| 8 | 500110 | |
| 9 | 498768 | |
| 7 | 305303 | 7.5% |
| 1 | 276534 | 6.8% |
| 3 | 259496 | 6.3% |
| 6 | 243474 | 5.9% |
| 0 | 240481 | 5.9% |
| 5 | 201841 | 4.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4093280 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 811117 | |
| 2 | 756156 | |
| 8 | 500110 | |
| 9 | 498768 | |
| 7 | 305303 | 7.5% |
| 1 | 276534 | 6.8% |
| 3 | 259496 | 6.3% |
| 6 | 243474 | 5.9% |
| 0 | 240481 | 5.9% |
| 5 | 201841 | 4.9% |
speciesKey
Text
Missing 
| Distinct | 8234 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 7853 |
| Missing (%) | 1.3% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.008723183 |
| Min length | 7 |
Unique
| Unique | 681 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 2492087 |
|---|---|
| 2nd row | 2480415 |
| 3rd row | 2481705 |
| 4th row | 9367409 |
| 5th row | 5229959 |
| Value | Count | Frequency (%) |
| 2492196 | 5111 | 0.9% |
| 9409198 | 4956 | 0.9% |
| 9362842 | 4264 | 0.7% |
| 5231142 | 3641 | 0.6% |
| 9415596 | 3345 | 0.6% |
| 2489670 | 2973 | 0.5% |
| 9510564 | 2633 | 0.5% |
| 5789284 | 1921 | 0.3% |
| 5231132 | 1886 | 0.3% |
| 2478259 | 1885 | 0.3% |
| Other values (8224) | 544124 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 724124 | |
| 4 | 637906 | |
| 9 | 466985 | |
| 8 | 437618 | |
| 5 | 324481 | |
| 1 | 308952 | |
| 3 | 303698 | |
| 7 | 294100 | |
| 0 | 292502 | |
| 6 | 251838 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4042204 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 724124 | |
| 4 | 637906 | |
| 9 | 466985 | |
| 8 | 437618 | |
| 5 | 324481 | |
| 1 | 308952 | |
| 3 | 303698 | |
| 7 | 294100 | |
| 0 | 292502 | |
| 6 | 251838 | 6.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4042204 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 724124 | |
| 4 | 637906 | |
| 9 | 466985 | |
| 8 | 437618 | |
| 5 | 324481 | |
| 1 | 308952 | |
| 3 | 303698 | |
| 7 | 294100 | |
| 0 | 292502 | |
| 6 | 251838 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4042204 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 724124 | |
| 4 | 637906 | |
| 9 | 466985 | |
| 8 | 437618 | |
| 5 | 324481 | |
| 1 | 308952 | |
| 3 | 303698 | |
| 7 | 294100 | |
| 0 | 292502 | |
| 6 | 251838 | 6.2% |
species
Text
Missing 
| Distinct | 8234 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 7853 |
| Missing (%) | 1.3% |
| Memory size | 4.5 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 29 |
| Mean length | 18.43415132 |
| Min length | 9 |
Unique
| Unique | 681 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Paroaria capitata |
|---|---|
| 2nd row | Rostrhamus sociabilis |
| 3rd row | Bartramia longicauda |
| 4th row | Sterna hirundo |
| 5th row | Prionochilus plateni |
| Value | Count | Frequency (%) |
| setophaga | 18263 | 1.6% |
| melospiza | 7103 | 0.6% |
| turdus | 6787 | 0.6% |
| calidris | 6682 | 0.6% |
| vireo | 6370 | 0.6% |
| agelaius | 5367 | 0.5% |
| melodia | 5111 | 0.4% |
| phoeniceus | 4986 | 0.4% |
| catharus | 4865 | 0.4% |
| hyemalis | 4856 | 0.4% |
| Other values (6762) | 1083406 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1140380 | 10.7% |
| i | 948191 | 8.9% |
| s | 897805 | 8.4% |
| r | 687139 | 6.5% |
| o | 682086 | 6.4% |
| e | 675341 | 6.4% |
| u | 655256 | 6.2% |
| l | 598920 | 5.6% |
| 577057 | 5.4% | |
| n | 529238 | 5.0% |
| Other values (43) | 3240281 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9477896 | |
| Space Separator | 577057 | 5.4% |
| Uppercase Letter | 576741 | 5.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1140380 | |
| i | 948191 | |
| s | 897805 | |
| r | 687139 | 7.2% |
| o | 682086 | 7.2% |
| e | 675341 | 7.1% |
| u | 655256 | 6.9% |
| l | 598920 | 6.3% |
| n | 529238 | 5.6% |
| c | 507575 | 5.4% |
| Other values (16) | 2155965 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 87799 | |
| P | 83433 | |
| S | 65465 | |
| A | 54586 | |
| M | 47035 | |
| T | 41684 | 7.2% |
| L | 29823 | 5.2% |
| E | 23359 | 4.1% |
| G | 17319 | 3.0% |
| H | 17005 | 2.9% |
| Other values (16) | 109233 |
Space Separator
| Value | Count | Frequency (%) |
| 577057 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10054637 | |
| Common | 577057 | 5.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1140380 | |
| i | 948191 | 9.4% |
| s | 897805 | 8.9% |
| r | 687139 | 6.8% |
| o | 682086 | 6.8% |
| e | 675341 | 6.7% |
| u | 655256 | 6.5% |
| l | 598920 | 6.0% |
| n | 529238 | 5.3% |
| c | 507575 | 5.0% |
| Other values (42) | 2732706 |
Common
| Value | Count | Frequency (%) |
| 577057 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10631694 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1140380 | 10.7% |
| i | 948191 | 8.9% |
| s | 897805 | 8.4% |
| r | 687139 | 6.5% |
| o | 682086 | 6.4% |
| e | 675341 | 6.4% |
| u | 655256 | 6.2% |
| l | 598920 | 5.6% |
| 577057 | 5.4% | |
| n | 529238 | 5.0% |
| Other values (43) | 3240281 |
| Distinct | 18485 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 101 |
|---|---|
| Median length | 69 |
| Mean length | 36.3684775 |
| Min length | 4 |
Unique
| Unique | 2480 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Paroaria capitata (d'Orbigny & Lafresnaye, 1837) |
|---|---|
| 2nd row | Rostrhamus sociabilis (Vieillot, 1817) |
| 3rd row | Bartramia longicauda (Bechstein, 1812) |
| 4th row | Sterna hirundo Linnaeus, 1758 |
| 5th row | Prionochilus plateni W.Blasius, 1888 |
| Value | Count | Frequency (%) |
| linnaeus | 96059 | 3.9% |
| 1758 | 62975 | 2.6% |
| 1766 | 31923 | 1.3% |
| 1789 | 22527 | 0.9% |
| 21216 | 0.9% | |
| vieillot | 20464 | 0.8% |
| setophaga | 18301 | 0.8% |
| j.f.gmelin | 17514 | 0.7% |
| ridgway | 15118 | 0.6% |
| gmelin | 12289 | 0.5% |
| Other values (11309) | 2119945 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1853739 | 8.7% | |
| a | 1757649 | 8.3% |
| i | 1549698 | 7.3% |
| s | 1388187 | 6.5% |
| e | 1256072 | 5.9% |
| n | 1103662 | 5.2% |
| r | 1034765 | 4.9% |
| o | 978110 | 4.6% |
| u | 968585 | 4.6% |
| l | 964073 | 4.5% |
| Other values (68) | 8406181 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15002770 | |
| Decimal Number | 1908932 | 9.0% |
| Space Separator | 1853739 | 8.7% |
| Uppercase Letter | 1233477 | 5.8% |
| Other Punctuation | 637605 | 3.0% |
| Close Punctuation | 310825 | 1.5% |
| Open Punctuation | 310825 | 1.5% |
| Dash Punctuation | 2548 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1757649 | |
| i | 1549698 | |
| s | 1388187 | |
| e | 1256072 | 8.4% |
| n | 1103662 | 7.4% |
| r | 1034765 | 6.9% |
| o | 978110 | 6.5% |
| u | 968585 | 6.5% |
| l | 964073 | 6.4% |
| t | 729354 | 4.9% |
| Other values (23) | 3272615 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 167639 | |
| S | 133493 | |
| P | 115415 | 9.4% |
| C | 112848 | 9.1% |
| G | 76000 | 6.2% |
| A | 75543 | 6.1% |
| B | 72757 | 5.9% |
| M | 66309 | 5.4% |
| T | 62888 | 5.1% |
| R | 49146 | 4.0% |
| Other values (17) | 301439 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 562050 | |
| 8 | 422555 | |
| 7 | 235719 | |
| 9 | 147314 | 7.7% |
| 6 | 128558 | 6.7% |
| 5 | 121276 | 6.4% |
| 3 | 81907 | 4.3% |
| 2 | 75569 | 4.0% |
| 0 | 67710 | 3.5% |
| 4 | 66274 | 3.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 477346 | |
| . | 138018 | 21.6% |
| & | 21216 | 3.3% |
| ' | 1025 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 1853739 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 310825 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 310825 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2548 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16236247 | |
| Common | 5024474 | 23.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1757649 | 10.8% |
| i | 1549698 | 9.5% |
| s | 1388187 | 8.5% |
| e | 1256072 | 7.7% |
| n | 1103662 | 6.8% |
| r | 1034765 | 6.4% |
| o | 978110 | 6.0% |
| u | 968585 | 6.0% |
| l | 964073 | 5.9% |
| t | 729354 | 4.5% |
| Other values (50) | 4506092 |
Common
| Value | Count | Frequency (%) |
| 1853739 | ||
| 1 | 562050 | 11.2% |
| , | 477346 | 9.5% |
| 8 | 422555 | 8.4% |
| ) | 310825 | 6.2% |
| ( | 310825 | 6.2% |
| 7 | 235719 | 4.7% |
| 9 | 147314 | 2.9% |
| . | 138018 | 2.7% |
| 6 | 128558 | 2.6% |
| Other values (8) | 437525 | 8.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21254630 | |
| None | 6091 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1853739 | 8.7% | |
| a | 1757649 | 8.3% |
| i | 1549698 | 7.3% |
| s | 1388187 | 6.5% |
| e | 1256072 | 5.9% |
| n | 1103662 | 5.2% |
| r | 1034765 | 4.9% |
| o | 978110 | 4.6% |
| u | 968585 | 4.6% |
| l | 964073 | 4.5% |
| Other values (60) | 8400090 |
None
| Value | Count | Frequency (%) |
| ü | 4413 | |
| é | 890 | 14.6% |
| á | 359 | 5.9% |
| è | 250 | 4.1% |
| ä | 90 | 1.5% |
| ö | 60 | 1.0% |
| É | 21 | 0.3% |
| ø | 8 | 0.1% |
| Distinct | 22061 |
|---|---|
| Distinct (%) | 3.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 65 |
|---|---|
| Median length | 50 |
| Mean length | 23.69967259 |
| Min length | 7 |
Unique
| Unique | 3436 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | Paroaria capitata |
|---|---|
| 2nd row | Rostrhamus sociabilis |
| 3rd row | Bartramia longicauda |
| 4th row | Sterna hirundo |
| 5th row | Prionochilus plateni |
| Value | Count | Frequency (%) |
| dendroica | 14826 | 1.0% |
| parus | 7485 | 0.5% |
| melospiza | 7103 | 0.5% |
| turdus | 6813 | 0.5% |
| vireo | 6404 | 0.4% |
| calidris | 6376 | 0.4% |
| sterna | 6184 | 0.4% |
| hyemalis | 5963 | 0.4% |
| melodia | 5927 | 0.4% |
| carduelis | 5742 | 0.4% |
| Other values (10903) | 1419872 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1477651 | 10.7% |
| i | 1303096 | 9.4% |
| s | 1190344 | 8.6% |
| r | 934012 | 6.7% |
| 908103 | 6.6% | |
| e | 885911 | 6.4% |
| u | 853994 | 6.2% |
| o | 821323 | 5.9% |
| l | 776498 | 5.6% |
| n | 730705 | 5.3% |
| Other values (48) | 3973002 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12360285 | |
| Space Separator | 908103 | 6.6% |
| Uppercase Letter | 584699 | 4.2% |
| Other Punctuation | 1511 | < 0.1% |
| Dash Punctuation | 41 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1477651 | |
| i | 1303096 | |
| s | 1190344 | |
| r | 934012 | 7.6% |
| e | 885911 | 7.2% |
| u | 853994 | 6.9% |
| o | 821323 | 6.6% |
| l | 776498 | 6.3% |
| n | 730705 | 5.9% |
| c | 671384 | 5.4% |
| Other values (16) | 2715367 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 92584 | |
| P | 87973 | |
| A | 57125 | |
| S | 48743 | |
| M | 44873 | 7.7% |
| T | 42452 | 7.3% |
| D | 28042 | 4.8% |
| L | 25741 | 4.4% |
| E | 22719 | 3.9% |
| G | 16764 | 2.9% |
| Other values (16) | 117683 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1121 | |
| " | 348 | 23.0% |
| / | 37 | 2.4% |
| ? | 5 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 908103 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 41 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12944984 | |
| Common | 909655 | 6.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1477651 | |
| i | 1303096 | 10.1% |
| s | 1190344 | 9.2% |
| r | 934012 | 7.2% |
| e | 885911 | 6.8% |
| u | 853994 | 6.6% |
| o | 821323 | 6.3% |
| l | 776498 | 6.0% |
| n | 730705 | 5.6% |
| c | 671384 | 5.2% |
| Other values (42) | 3300066 |
Common
| Value | Count | Frequency (%) |
| 908103 | ||
| . | 1121 | 0.1% |
| " | 348 | < 0.1% |
| - | 41 | < 0.1% |
| / | 37 | < 0.1% |
| ? | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13854639 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1477651 | 10.7% |
| i | 1303096 | 9.4% |
| s | 1190344 | 8.6% |
| r | 934012 | 6.7% |
| 908103 | 6.6% | |
| e | 885911 | 6.4% |
| u | 853994 | 6.2% |
| o | 821323 | 5.9% |
| l | 776498 | 5.6% |
| n | 730705 | 5.3% |
| Other values (48) | 3973002 |
protocol
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EML |
|---|---|
| 2nd row | EML |
| 3rd row | EML |
| 4th row | EML |
| 5th row | EML |
| Value | Count | Frequency (%) |
| eml | 584592 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 584592 | |
| M | 584592 | |
| L | 584592 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1753776 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 584592 | |
| M | 584592 | |
| L | 584592 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1753776 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 584592 | |
| M | 584592 | |
| L | 584592 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1753776 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 584592 | |
| M | 584592 | |
| L | 584592 |
lastParsed
Text
| Distinct | 183965 |
|---|---|
| Distinct (%) | 31.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99608616 |
| Min length | 20 |
Unique
| Unique | 40346 ? |
|---|---|
| Unique (%) | 6.9% |
Sample
| 1st row | 2024-12-02T13:56:05.137Z |
|---|---|
| 2nd row | 2024-12-02T13:56:08.067Z |
| 3rd row | 2024-12-02T13:59:48.585Z |
| 4th row | 2024-12-02T13:56:09.311Z |
| 5th row | 2024-12-02T13:58:24.805Z |
| Value | Count | Frequency (%) |
| 2024-12-02t13:57:59.341z | 17 | < 0.1% |
| 2024-12-02t13:57:45.007z | 16 | < 0.1% |
| 2024-12-02t13:57:38.028z | 16 | < 0.1% |
| 2024-12-02t13:57:53.841z | 16 | < 0.1% |
| 2024-12-02t13:57:44.964z | 15 | < 0.1% |
| 2024-12-02t13:58:02.321z | 15 | < 0.1% |
| 2024-12-02t13:57:53.332z | 15 | < 0.1% |
| 2024-12-02t13:57:51.208z | 15 | < 0.1% |
| 2024-12-02t13:58:02.659z | 15 | < 0.1% |
| 2024-12-02t13:57:41.116z | 15 | < 0.1% |
| Other values (183955) | 584437 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2670609 | |
| 0 | 1482175 | |
| 1 | 1475322 | |
| - | 1169184 | |
| : | 1169184 | |
| 4 | 940152 | 6.7% |
| 5 | 927783 | 6.6% |
| 3 | 925430 | 6.6% |
| T | 584592 | 4.2% |
| Z | 584592 | 4.2% |
| Other values (5) | 2098897 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9936348 | |
| Other Punctuation | 1753204 | 12.5% |
| Dash Punctuation | 1169184 | 8.3% |
| Uppercase Letter | 1169184 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2670609 | |
| 0 | 1482175 | |
| 1 | 1475322 | |
| 4 | 940152 | 9.5% |
| 5 | 927783 | 9.3% |
| 3 | 925430 | 9.3% |
| 7 | 449478 | 4.5% |
| 9 | 373966 | 3.8% |
| 6 | 351326 | 3.5% |
| 8 | 340107 | 3.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1169184 | |
| . | 584020 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 584592 | |
| Z | 584592 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1169184 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12858736 | |
| Latin | 1169184 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2670609 | |
| 0 | 1482175 | |
| 1 | 1475322 | |
| - | 1169184 | |
| : | 1169184 | |
| 4 | 940152 | 7.3% |
| 5 | 927783 | 7.2% |
| 3 | 925430 | 7.2% |
| . | 584020 | 4.5% |
| 7 | 449478 | 3.5% |
| Other values (3) | 1065399 | 8.3% |
Latin
| Value | Count | Frequency (%) |
| T | 584592 | |
| Z | 584592 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14027920 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2670609 | |
| 0 | 1482175 | |
| 1 | 1475322 | |
| - | 1169184 | |
| : | 1169184 | |
| 4 | 940152 | 6.7% |
| 5 | 927783 | 6.6% |
| 3 | 925430 | 6.6% |
| T | 584592 | 4.2% |
| Z | 584592 | 4.2% |
| Other values (5) | 2098897 |
lastCrawled
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 24 |
| Min length | 24 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2024-12-02T11:48:23.416Z |
|---|---|
| 2nd row | 2024-12-02T11:48:23.416Z |
| 3rd row | 2024-12-02T11:48:23.416Z |
| 4th row | 2024-12-02T11:48:23.416Z |
| 5th row | 2024-12-02T11:48:23.416Z |
| Value | Count | Frequency (%) |
| 2024-12-02t11:48:23.416z | 584592 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2922960 | |
| 1 | 2338368 | |
| 4 | 1753776 | |
| 0 | 1169184 | 8.3% |
| - | 1169184 | 8.3% |
| : | 1169184 | 8.3% |
| T | 584592 | 4.2% |
| 8 | 584592 | 4.2% |
| 3 | 584592 | 4.2% |
| . | 584592 | 4.2% |
| Other values (2) | 1169184 | 8.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9938064 | |
| Other Punctuation | 1753776 | 12.5% |
| Dash Punctuation | 1169184 | 8.3% |
| Uppercase Letter | 1169184 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2922960 | |
| 1 | 2338368 | |
| 4 | 1753776 | |
| 0 | 1169184 | 11.8% |
| 8 | 584592 | 5.9% |
| 3 | 584592 | 5.9% |
| 6 | 584592 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1169184 | |
| . | 584592 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 584592 | |
| Z | 584592 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1169184 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12861024 | |
| Latin | 1169184 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2922960 | |
| 1 | 2338368 | |
| 4 | 1753776 | |
| 0 | 1169184 | 9.1% |
| - | 1169184 | 9.1% |
| : | 1169184 | 9.1% |
| 8 | 584592 | 4.5% |
| 3 | 584592 | 4.5% |
| . | 584592 | 4.5% |
| 6 | 584592 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| T | 584592 | |
| Z | 584592 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14030208 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2922960 | |
| 1 | 2338368 | |
| 4 | 1753776 | |
| 0 | 1169184 | 8.3% |
| - | 1169184 | 8.3% |
| : | 1169184 | 8.3% |
| T | 584592 | 4.2% |
| 8 | 584592 | 4.2% |
| 3 | 584592 | 4.2% |
| . | 584592 | 4.2% |
| Other values (2) | 1169184 | 8.3% |
repatriated
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3194 |
| Missing (%) | 0.5% |
| Memory size | 4.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.372956219 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | true |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | true |
| Value | Count | Frequency (%) |
| true | 364562 | |
| false | 216836 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 581398 | |
| t | 364562 | |
| r | 364562 | |
| u | 364562 | |
| f | 216836 | 8.5% |
| a | 216836 | 8.5% |
| l | 216836 | 8.5% |
| s | 216836 | 8.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2542428 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 581398 | |
| t | 364562 | |
| r | 364562 | |
| u | 364562 | |
| f | 216836 | 8.5% |
| a | 216836 | 8.5% |
| l | 216836 | 8.5% |
| s | 216836 | 8.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2542428 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 581398 | |
| t | 364562 | |
| r | 364562 | |
| u | 364562 | |
| f | 216836 | 8.5% |
| a | 216836 | 8.5% |
| l | 216836 | 8.5% |
| s | 216836 | 8.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2542428 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 581398 | |
| t | 364562 | |
| r | 364562 | |
| u | 364562 | |
| f | 216836 | 8.5% |
| a | 216836 | 8.5% |
| l | 216836 | 8.5% |
| s | 216836 | 8.5% |
isSequenced
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.992324561 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 580105 | |
| true | 4487 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 584592 | |
| f | 580105 | |
| a | 580105 | |
| l | 580105 | |
| s | 580105 | |
| t | 4487 | 0.2% |
| r | 4487 | 0.2% |
| u | 4487 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2918473 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 584592 | |
| f | 580105 | |
| a | 580105 | |
| l | 580105 | |
| s | 580105 | |
| t | 4487 | 0.2% |
| r | 4487 | 0.2% |
| u | 4487 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2918473 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 584592 | |
| f | 580105 | |
| a | 580105 | |
| l | 580105 | |
| s | 580105 | |
| t | 4487 | 0.2% |
| r | 4487 | 0.2% |
| u | 4487 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2918473 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 584592 | |
| f | 580105 | |
| a | 580105 | |
| l | 580105 | |
| s | 580105 | |
| t | 4487 | 0.2% |
| r | 4487 | 0.2% |
| u | 4487 | 0.2% |
gbifRegion
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 19462 |
| Missing (%) | 3.3% |
| Memory size | 4.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 10.62288677 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | LATIN_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | ASIA |
| Value | Count | Frequency (%) |
| north_america | 235097 | |
| latin_america | 161402 | |
| asia | 91675 | 16.2% |
| africa | 47164 | 8.3% |
| oceania | 14385 | 2.5% |
| europe | 13906 | 2.5% |
| antarctica | 1501 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1265351 | |
| I | 712626 | |
| R | 694167 | |
| C | 461050 | 7.7% |
| E | 438696 | 7.3% |
| N | 412385 | 6.9% |
| T | 399501 | 6.7% |
| _ | 396499 | 6.6% |
| M | 396499 | 6.6% |
| O | 263388 | 4.4% |
| Other values (6) | 563150 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 5606813 | |
| Connector Punctuation | 396499 | 6.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1265351 | |
| I | 712626 | |
| R | 694167 | |
| C | 461050 | 8.2% |
| E | 438696 | 7.8% |
| N | 412385 | 7.4% |
| T | 399501 | 7.1% |
| M | 396499 | 7.1% |
| O | 263388 | 4.7% |
| H | 235097 | 4.2% |
| Other values (5) | 328053 | 5.9% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 396499 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5606813 | |
| Common | 396499 | 6.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1265351 | |
| I | 712626 | |
| R | 694167 | |
| C | 461050 | 8.2% |
| E | 438696 | 7.8% |
| N | 412385 | 7.4% |
| T | 399501 | 7.1% |
| M | 396499 | 7.1% |
| O | 263388 | 4.7% |
| H | 235097 | 4.2% |
| Other values (5) | 328053 | 5.9% |
Common
| Value | Count | Frequency (%) |
| _ | 396499 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6003312 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 1265351 | |
| I | 712626 | |
| R | 694167 | |
| C | 461050 | 7.7% |
| E | 438696 | 7.3% |
| N | 412385 | 6.9% |
| T | 399501 | 6.7% |
| _ | 396499 | 6.6% |
| M | 396499 | 6.6% |
| O | 263388 | 4.4% |
| Other values (6) | 563150 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 584592 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 1169184 | |
| A | 1169184 | |
| N | 584592 | |
| O | 584592 | |
| T | 584592 | |
| H | 584592 | |
| _ | 584592 | |
| M | 584592 | |
| E | 584592 | |
| I | 584592 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7015104 | |
| Connector Punctuation | 584592 | 7.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 1169184 | |
| A | 1169184 | |
| N | 584592 | |
| O | 584592 | |
| T | 584592 | |
| H | 584592 | |
| M | 584592 | |
| E | 584592 | |
| I | 584592 | |
| C | 584592 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 584592 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7015104 | |
| Common | 584592 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 1169184 | |
| A | 1169184 | |
| N | 584592 | |
| O | 584592 | |
| T | 584592 | |
| H | 584592 | |
| M | 584592 | |
| E | 584592 | |
| I | 584592 | |
| C | 584592 |
Common
| Value | Count | Frequency (%) |
| _ | 584592 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7599696 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 1169184 | |
| A | 1169184 | |
| N | 584592 | |
| O | 584592 | |
| T | 584592 | |
| H | 584592 | |
| _ | 584592 | |
| M | 584592 | |
| E | 584592 | |
| I | 584592 |
level0Gid
Text
Missing 
| Distinct | 105 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 562100 |
| Missing (%) | 96.2% |
| Memory size | 4.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 15 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | USA |
|---|---|
| 2nd row | MYS |
| 3rd row | COL |
| 4th row | COL |
| 5th row | IND |
| Value | Count | Frequency (%) |
| usa | 3144 | |
| eth | 2614 | 11.6% |
| col | 2574 | 11.4% |
| tza | 1839 | 8.2% |
| afg | 1717 | 7.6% |
| rus | 826 | 3.7% |
| per | 772 | 3.4% |
| guy | 696 | 3.1% |
| bra | 631 | 2.8% |
| ven | 607 | 2.7% |
| Other values (95) | 7072 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 9076 | |
| T | 5591 | 8.3% |
| U | 5200 | 7.7% |
| S | 4996 | 7.4% |
| E | 4583 | 6.8% |
| R | 4031 | 6.0% |
| H | 3631 | 5.4% |
| L | 3528 | 5.2% |
| C | 3243 | 4.8% |
| G | 3139 | 4.7% |
| Other values (18) | 20458 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 67474 | |
| Decimal Number | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 9076 | |
| T | 5591 | 8.3% |
| U | 5200 | 7.7% |
| S | 4996 | 7.4% |
| E | 4583 | 6.8% |
| R | 4031 | 6.0% |
| H | 3631 | 5.4% |
| L | 3528 | 5.2% |
| C | 3243 | 4.8% |
| G | 3139 | 4.7% |
| Other values (16) | 20456 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 2 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 67474 | |
| Common | 2 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 9076 | |
| T | 5591 | 8.3% |
| U | 5200 | 7.7% |
| S | 4996 | 7.4% |
| E | 4583 | 6.8% |
| R | 4031 | 6.0% |
| H | 3631 | 5.4% |
| L | 3528 | 5.2% |
| C | 3243 | 4.8% |
| G | 3139 | 4.7% |
| Other values (16) | 20456 |
Common
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 2 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 67476 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 9076 | |
| T | 5591 | 8.3% |
| U | 5200 | 7.7% |
| S | 4996 | 7.4% |
| E | 4583 | 6.8% |
| R | 4031 | 6.0% |
| H | 3631 | 5.4% |
| L | 3528 | 5.2% |
| C | 3243 | 4.8% |
| G | 3139 | 4.7% |
| Other values (18) | 20458 |
level0Name
Text
Missing 
| Distinct | 105 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 562100 |
| Missing (%) | 96.2% |
| Memory size | 4.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 30 |
| Mean length | 8.545304997 |
| Min length | 4 |
Unique
| Unique | 15 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | United States |
|---|---|
| 2nd row | Malaysia |
| 3rd row | Colombia |
| 4th row | Colombia |
| 5th row | India |
| Value | Count | Frequency (%) |
| united | 3159 | 11.8% |
| states | 3150 | 11.8% |
| ethiopia | 2614 | 9.8% |
| colombia | 2574 | 9.7% |
| tanzania | 1839 | 6.9% |
| afghanistan | 1717 | 6.4% |
| russia | 826 | 3.1% |
| peru | 772 | 2.9% |
| guyana | 696 | 2.6% |
| brazil | 631 | 2.4% |
| Other values (124) | 8681 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 30627 | |
| i | 21547 | 11.2% |
| n | 15591 | 8.1% |
| t | 15171 | 7.9% |
| e | 11832 | 6.2% |
| o | 9996 | 5.2% |
| s | 7991 | 4.2% |
| l | 5975 | 3.1% |
| h | 5409 | 2.8% |
| d | 5050 | 2.6% |
| Other values (41) | 63012 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 161462 | |
| Uppercase Letter | 26571 | 13.8% |
| Space Separator | 4167 | 2.2% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 30627 | |
| i | 21547 | |
| n | 15591 | |
| t | 15171 | |
| e | 11832 | 7.3% |
| o | 9996 | 6.2% |
| s | 7991 | 4.9% |
| l | 5975 | 3.7% |
| h | 5409 | 3.4% |
| d | 5050 | 3.1% |
| Other values (17) | 32273 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 3783 | |
| U | 3541 | |
| C | 2941 | |
| E | 2916 | |
| T | 2351 | |
| A | 1931 | |
| P | 1892 | |
| G | 1242 | 4.7% |
| M | 1221 | 4.6% |
| I | 1009 | 3.8% |
| Other values (12) | 3744 |
Space Separator
| Value | Count | Frequency (%) |
| 4167 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 188033 | |
| Common | 4168 | 2.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 30627 | |
| i | 21547 | 11.5% |
| n | 15591 | 8.3% |
| t | 15171 | 8.1% |
| e | 11832 | 6.3% |
| o | 9996 | 5.3% |
| s | 7991 | 4.2% |
| l | 5975 | 3.2% |
| h | 5409 | 2.9% |
| d | 5050 | 2.7% |
| Other values (39) | 58844 |
Common
| Value | Count | Frequency (%) |
| 4167 | ||
| , | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 191824 | |
| None | 377 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 30627 | |
| i | 21547 | 11.2% |
| n | 15591 | 8.1% |
| t | 15171 | 7.9% |
| e | 11832 | 6.2% |
| o | 9996 | 5.2% |
| s | 7991 | 4.2% |
| l | 5975 | 3.1% |
| h | 5409 | 2.8% |
| d | 5050 | 2.6% |
| Other values (40) | 62635 |
None
| Value | Count | Frequency (%) |
| é | 377 |
level1Gid
Text
Missing 
| Distinct | 474 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 562129 |
| Missing (%) | 96.2% |
| Memory size | 4.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.554333793 |
| Min length | 6 |
Unique
| Unique | 114 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | USA.49_1 |
|---|---|
| 2nd row | MYS.13_1 |
| 3rd row | COL.6_2 |
| 4th row | COL.4_2 |
| 5th row | IND.2_1 |
| Value | Count | Frequency (%) |
| eth.8_1 | 1052 | 4.7% |
| afg.28_1 | 995 | 4.4% |
| usa.2_1 | 907 | 4.0% |
| afg.15_1 | 663 | 3.0% |
| tza.14_1 | 573 | 2.6% |
| eth.4_1 | 547 | 2.4% |
| bra.14_1 | 486 | 2.2% |
| eth.6_1 | 475 | 2.1% |
| kwt.3_1 | 473 | 2.1% |
| per.8_1 | 473 | 2.1% |
| Other values (464) | 15819 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 28607 | |
| _ | 22463 | |
| . | 22443 | |
| 2 | 9571 | 5.6% |
| A | 9043 | 5.3% |
| T | 5565 | 3.3% |
| U | 5200 | 3.1% |
| S | 4996 | 2.9% |
| E | 4583 | 2.7% |
| 4 | 4144 | 2.4% |
| Other values (28) | 53078 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 67387 | |
| Decimal Number | 57400 | |
| Connector Punctuation | 22463 | 13.2% |
| Other Punctuation | 22443 | 13.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 9043 | |
| T | 5565 | 8.3% |
| U | 5200 | 7.7% |
| S | 4996 | 7.4% |
| E | 4583 | 6.8% |
| R | 4029 | 6.0% |
| H | 3631 | 5.4% |
| L | 3528 | 5.2% |
| C | 3242 | 4.8% |
| G | 3139 | 4.7% |
| Other values (16) | 20431 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 28607 | |
| 2 | 9571 | 16.7% |
| 4 | 4144 | 7.2% |
| 8 | 4002 | 7.0% |
| 3 | 3062 | 5.3% |
| 5 | 2456 | 4.3% |
| 0 | 1764 | 3.1% |
| 6 | 1648 | 2.9% |
| 9 | 1238 | 2.2% |
| 7 | 908 | 1.6% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 22463 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 22443 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 102306 | |
| Latin | 67387 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 9043 | |
| T | 5565 | 8.3% |
| U | 5200 | 7.7% |
| S | 4996 | 7.4% |
| E | 4583 | 6.8% |
| R | 4029 | 6.0% |
| H | 3631 | 5.4% |
| L | 3528 | 5.2% |
| C | 3242 | 4.8% |
| G | 3139 | 4.7% |
| Other values (16) | 20431 |
Common
| Value | Count | Frequency (%) |
| 1 | 28607 | |
| _ | 22463 | |
| . | 22443 | |
| 2 | 9571 | 9.4% |
| 4 | 4144 | 4.1% |
| 8 | 4002 | 3.9% |
| 3 | 3062 | 3.0% |
| 5 | 2456 | 2.4% |
| 0 | 1764 | 1.7% |
| 6 | 1648 | 1.6% |
| Other values (2) | 2146 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 169693 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 28607 | |
| _ | 22463 | |
| . | 22443 | |
| 2 | 9571 | 5.6% |
| A | 9043 | 5.3% |
| T | 5565 | 3.3% |
| U | 5200 | 3.1% |
| S | 4996 | 2.9% |
| E | 4583 | 2.7% |
| 4 | 4144 | 2.4% |
| Other values (28) | 53078 |
level1Name
Text
Missing 
| Distinct | 464 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 562129 |
| Missing (%) | 96.2% |
| Memory size | 4.5 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 25 |
| Mean length | 8.892356319 |
| Min length | 3 |
Unique
| Unique | 111 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | West Virginia |
|---|---|
| 2nd row | Sabah |
| 3rd row | Bolívar |
| 4th row | Atlántico |
| 5th row | Andhra Pradesh |
| Value | Count | Frequency (%) |
| oromia | 1052 | 3.7% |
| parwan | 995 | 3.5% |
| alaska | 907 | 3.2% |
| kandahar | 663 | 2.3% |
| morogoro | 573 | 2.0% |
| benshangul-gumaz | 547 | 1.9% |
| la | 528 | 1.8% |
| pará | 486 | 1.7% |
| gambela | 475 | 1.7% |
| peoples | 475 | 1.7% |
| Other values (530) | 22076 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 34856 | |
| r | 15095 | 7.6% |
| o | 12869 | 6.4% |
| n | 12568 | 6.3% |
| i | 10334 | 5.2% |
| e | 9945 | 5.0% |
| l | 7814 | 3.9% |
| s | 7349 | 3.7% |
| 6314 | 3.2% | |
| u | 6303 | 3.2% |
| Other values (71) | 76302 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 160545 | |
| Uppercase Letter | 29921 | 15.0% |
| Space Separator | 6314 | 3.2% |
| Dash Punctuation | 1871 | 0.9% |
| Other Punctuation | 1098 | 0.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 34856 | |
| r | 15095 | 9.4% |
| o | 12869 | 8.0% |
| n | 12568 | 7.8% |
| i | 10334 | 6.4% |
| e | 9945 | 6.2% |
| l | 7814 | 4.9% |
| s | 7349 | 4.6% |
| u | 6303 | 3.9% |
| h | 5478 | 3.4% |
| Other values (35) | 37934 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 3472 | |
| P | 3269 | |
| A | 3137 | |
| M | 2361 | 7.9% |
| B | 1881 | 6.3% |
| K | 1834 | 6.1% |
| S | 1762 | 5.9% |
| T | 1704 | 5.7% |
| G | 1516 | 5.1% |
| N | 1511 | 5.0% |
| Other values (19) | 7474 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 582 | |
| ' | 365 | |
| / | 55 | 5.0% |
| ! | 51 | 4.6% |
| , | 45 | 4.1% |
Space Separator
| Value | Count | Frequency (%) |
| 6314 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1871 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 190466 | |
| Common | 9283 | 4.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 34856 | |
| r | 15095 | 7.9% |
| o | 12869 | 6.8% |
| n | 12568 | 6.6% |
| i | 10334 | 5.4% |
| e | 9945 | 5.2% |
| l | 7814 | 4.1% |
| s | 7349 | 3.9% |
| u | 6303 | 3.3% |
| h | 5478 | 2.9% |
| Other values (64) | 67855 |
Common
| Value | Count | Frequency (%) |
| 6314 | ||
| - | 1871 | 20.2% |
| . | 582 | 6.3% |
| ' | 365 | 3.9% |
| / | 55 | 0.6% |
| ! | 51 | 0.5% |
| , | 45 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 197034 | |
| None | 2705 | 1.4% |
| Latin Ext Additional | 10 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 34856 | |
| r | 15095 | 7.7% |
| o | 12869 | 6.5% |
| n | 12568 | 6.4% |
| i | 10334 | 5.2% |
| e | 9945 | 5.0% |
| l | 7814 | 4.0% |
| s | 7349 | 3.7% |
| 6314 | 3.2% | |
| u | 6303 | 3.2% |
| Other values (49) | 73587 |
None
| Value | Count | Frequency (%) |
| á | 1273 | |
| í | 439 | 16.2% |
| ó | 393 | 14.5% |
| é | 150 | 5.5% |
| ú | 102 | 3.8% |
| č | 87 | 3.2% |
| ð | 68 | 2.5% |
| ö | 46 | 1.7% |
| ț | 33 | 1.2% |
| ş | 28 | 1.0% |
| Other values (10) | 86 | 3.2% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ồ | 9 | |
| ậ | 1 | 10.0% |
level2Gid
Text
Missing 
| Distinct | 1023 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 562935 |
| Missing (%) | 96.3% |
| Memory size | 4.5 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 9.903633929 |
| Min length | 9 |
Unique
| Unique | 301 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | USA.49.36_1 |
|---|---|
| 2nd row | MYS.13.14_1 |
| 3rd row | COL.6.38_2 |
| 4th row | COL.4.9_2 |
| 5th row | IND.2.10_1 |
| Value | Count | Frequency (%) |
| afg.28.1_1 | 995 | 4.6% |
| afg.15.3_1 | 663 | 3.1% |
| eth.4.2_1 | 547 | 2.5% |
| eth.8.3_1 | 515 | 2.4% |
| eth.6.1_1 | 475 | 2.2% |
| per.8.9_1 | 473 | 2.2% |
| tza.14.6_1 | 457 | 2.1% |
| bra.14.8_2 | 452 | 2.1% |
| eth.8.15_1 | 341 | 1.6% |
| tza.20.4_1 | 306 | 1.4% |
| Other values (1013) | 16433 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 43294 | |
| 1 | 34220 | |
| _ | 21657 | 10.1% |
| 2 | 15272 | 7.1% |
| A | 8908 | 4.2% |
| 4 | 6824 | 3.2% |
| 3 | 6631 | 3.1% |
| 8 | 5800 | 2.7% |
| U | 5194 | 2.4% |
| S | 4985 | 2.3% |
| Other values (28) | 61698 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 84563 | |
| Uppercase Letter | 64969 | |
| Other Punctuation | 43294 | |
| Connector Punctuation | 21657 | 10.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 8908 | |
| U | 5194 | 8.0% |
| S | 4985 | 7.7% |
| T | 4930 | 7.6% |
| E | 4582 | 7.1% |
| R | 3925 | 6.0% |
| H | 3631 | 5.6% |
| L | 3521 | 5.4% |
| C | 3238 | 5.0% |
| G | 3132 | 4.8% |
| Other values (16) | 18923 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 34220 | |
| 2 | 15272 | |
| 4 | 6824 | 8.1% |
| 3 | 6631 | 7.8% |
| 8 | 5800 | 6.9% |
| 5 | 4425 | 5.2% |
| 6 | 3541 | 4.2% |
| 0 | 2843 | 3.4% |
| 7 | 2655 | 3.1% |
| 9 | 2352 | 2.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 43294 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 21657 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 149514 | |
| Latin | 64969 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 8908 | |
| U | 5194 | 8.0% |
| S | 4985 | 7.7% |
| T | 4930 | 7.6% |
| E | 4582 | 7.1% |
| R | 3925 | 6.0% |
| H | 3631 | 5.6% |
| L | 3521 | 5.4% |
| C | 3238 | 5.0% |
| G | 3132 | 4.8% |
| Other values (16) | 18923 |
Common
| Value | Count | Frequency (%) |
| . | 43294 | |
| 1 | 34220 | |
| _ | 21657 | |
| 2 | 15272 | 10.2% |
| 4 | 6824 | 4.6% |
| 3 | 6631 | 4.4% |
| 8 | 5800 | 3.9% |
| 5 | 4425 | 3.0% |
| 6 | 3541 | 2.4% |
| 0 | 2843 | 1.9% |
| Other values (2) | 5007 | 3.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 214483 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 43294 | |
| 1 | 34220 | |
| _ | 21657 | 10.1% |
| 2 | 15272 | 7.1% |
| A | 8908 | 4.2% |
| 4 | 6824 | 3.2% |
| 3 | 6631 | 3.1% |
| 8 | 5800 | 2.7% |
| U | 5194 | 2.4% |
| S | 4985 | 2.3% |
| Other values (28) | 61698 |
level2Name
Text
Missing 
| Distinct | 991 |
|---|---|
| Distinct (%) | 4.6% |
| Missing | 563182 |
| Missing (%) | 96.3% |
| Memory size | 4.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 28 |
| Mean length | 9.567211583 |
| Min length | 2 |
Unique
| Unique | 282 ? |
|---|---|
| Unique (%) | 1.3% |
Sample
| 1st row | Pendleton |
|---|---|
| 2nd row | Penampang |
| 3rd row | Simití |
| 4th row | Manatí |
| 5th row | Visakhapatnam |
| Value | Count | Frequency (%) |
| bagram | 995 | 3.0% |
| la | 771 | 2.3% |
| rayon | 678 | 2.1% |
| daman | 663 | 2.0% |
| of | 606 | 1.8% |
| kemashi | 547 | 1.7% |
| san | 517 | 1.6% |
| borena | 515 | 1.6% |
| rest | 489 | 1.5% |
| convención | 484 | 1.5% |
| Other values (1170) | 26622 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 29287 | 14.3% |
| o | 14241 | 7.0% |
| n | 13826 | 6.7% |
| e | 13296 | 6.5% |
| i | 12566 | 6.1% |
| r | 11826 | 5.8% |
| 11477 | 5.6% | |
| t | 8023 | 3.9% |
| s | 6919 | 3.4% |
| l | 6642 | 3.2% |
| Other values (84) | 76731 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 161183 | |
| Uppercase Letter | 30130 | 14.7% |
| Space Separator | 11477 | 5.6% |
| Decimal Number | 897 | 0.4% |
| Other Punctuation | 858 | 0.4% |
| Dash Punctuation | 279 | 0.1% |
| Open Punctuation | 10 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 29287 | |
| o | 14241 | 8.8% |
| n | 13826 | 8.6% |
| e | 13296 | 8.2% |
| i | 12566 | 7.8% |
| r | 11826 | 7.3% |
| t | 8023 | 5.0% |
| s | 6919 | 4.3% |
| l | 6642 | 4.1% |
| g | 5977 | 3.7% |
| Other values (39) | 38580 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 3445 | |
| A | 2880 | 9.6% |
| S | 2800 | 9.3% |
| M | 2700 | 9.0% |
| C | 2361 | 7.8% |
| K | 1985 | 6.6% |
| R | 1777 | 5.9% |
| T | 1728 | 5.7% |
| L | 1648 | 5.5% |
| P | 1369 | 4.5% |
| Other values (19) | 7437 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 297 | |
| 1 | 220 | |
| 7 | 175 | |
| 9 | 125 | |
| 8 | 38 | 4.2% |
| 0 | 28 | 3.1% |
| 4 | 9 | 1.0% |
| 2 | 3 | 0.3% |
| 5 | 2 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 635 | |
| ' | 119 | 13.9% |
| , | 56 | 6.5% |
| / | 48 | 5.6% |
Space Separator
| Value | Count | Frequency (%) |
| 11477 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 279 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 191313 | |
| Common | 13521 | 6.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 29287 | |
| o | 14241 | 7.4% |
| n | 13826 | 7.2% |
| e | 13296 | 6.9% |
| i | 12566 | 6.6% |
| r | 11826 | 6.2% |
| t | 8023 | 4.2% |
| s | 6919 | 3.6% |
| l | 6642 | 3.5% |
| g | 5977 | 3.1% |
| Other values (68) | 68710 |
Common
| Value | Count | Frequency (%) |
| 11477 | ||
| . | 635 | 4.7% |
| 3 | 297 | 2.2% |
| - | 279 | 2.1% |
| 1 | 220 | 1.6% |
| 7 | 175 | 1.3% |
| 9 | 125 | 0.9% |
| ' | 119 | 0.9% |
| , | 56 | 0.4% |
| / | 48 | 0.4% |
| Other values (6) | 90 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 202086 | |
| None | 2739 | 1.3% |
| Latin Ext Additional | 9 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 29287 | 14.5% |
| o | 14241 | 7.0% |
| n | 13826 | 6.8% |
| e | 13296 | 6.6% |
| i | 12566 | 6.2% |
| r | 11826 | 5.9% |
| 11477 | 5.7% | |
| t | 8023 | 4.0% |
| s | 6919 | 3.4% |
| l | 6642 | 3.3% |
| Other values (58) | 73983 |
None
| Value | Count | Frequency (%) |
| í | 905 | |
| á | 755 | |
| ó | 654 | |
| é | 138 | 5.0% |
| ð | 57 | 2.1% |
| ú | 52 | 1.9% |
| ñ | 48 | 1.8% |
| â | 30 | 1.1% |
| É | 24 | 0.9% |
| æ | 18 | 0.7% |
| Other values (12) | 58 | 2.1% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ạ | 4 | |
| ả | 3 | |
| ứ | 1 | 11.1% |
| ọ | 1 | 11.1% |
level3Gid
Text
Missing 
| Distinct | 468 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 575359 |
| Missing (%) | 98.4% |
| Memory size | 4.5 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 13 |
| Mean length | 11.89450883 |
| Min length | 11 |
Unique
| Unique | 169 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | IND.2.10.3_1 |
|---|---|
| 2nd row | RUS.34.42.1_1 |
| 3rd row | TZA.9.4.11_1 |
| 4th row | GRC.6.2.16_1 |
| 5th row | ETH.8.3.1_1 |
| Value | Count | Frequency (%) |
| eth.4.2.2_1 | 547 | 5.9% |
| eth.8.3.1_1 | 499 | 5.4% |
| eth.6.1.3_1 | 464 | 5.0% |
| tza.14.6.4_1 | 457 | 4.9% |
| per.8.9.7_1 | 329 | 3.6% |
| tza.20.4.4_1 | 306 | 3.3% |
| eth.2.3.6_1 | 289 | 3.1% |
| eth.8.15.11_1 | 277 | 3.0% |
| ind.31.22.2_1 | 228 | 2.5% |
| tza.9.4.11_1 | 203 | 2.2% |
| Other values (458) | 5634 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 27699 | |
| 1 | 19645 | |
| _ | 9233 | 8.4% |
| 2 | 5321 | 4.8% |
| T | 4830 | 4.4% |
| 4 | 4335 | 3.9% |
| 3 | 4012 | 3.7% |
| H | 3611 | 3.3% |
| E | 3546 | 3.2% |
| 8 | 2860 | 2.6% |
| Other values (24) | 24730 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 45193 | |
| Other Punctuation | 27699 | |
| Uppercase Letter | 27697 | |
| Connector Punctuation | 9233 | 8.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 4830 | |
| H | 3611 | |
| E | 3546 | |
| A | 2746 | |
| R | 2293 | |
| Z | 2007 | |
| P | 1580 | 5.7% |
| N | 1224 | 4.4% |
| U | 888 | 3.2% |
| S | 849 | 3.1% |
| Other values (12) | 4123 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 19645 | |
| 2 | 5321 | 11.8% |
| 4 | 4335 | 9.6% |
| 3 | 4012 | 8.9% |
| 8 | 2860 | 6.3% |
| 6 | 2581 | 5.7% |
| 0 | 1912 | 4.2% |
| 5 | 1768 | 3.9% |
| 9 | 1425 | 3.2% |
| 7 | 1334 | 3.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 27699 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 9233 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 82125 | |
| Latin | 27697 | 25.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 4830 | |
| H | 3611 | |
| E | 3546 | |
| A | 2746 | |
| R | 2293 | |
| Z | 2007 | |
| P | 1580 | 5.7% |
| N | 1224 | 4.4% |
| U | 888 | 3.2% |
| S | 849 | 3.1% |
| Other values (12) | 4123 |
Common
| Value | Count | Frequency (%) |
| . | 27699 | |
| 1 | 19645 | |
| _ | 9233 | 11.2% |
| 2 | 5321 | 6.5% |
| 4 | 4335 | 5.3% |
| 3 | 4012 | 4.9% |
| 8 | 2860 | 3.5% |
| 6 | 2581 | 3.1% |
| 0 | 1912 | 2.3% |
| 5 | 1768 | 2.2% |
| Other values (2) | 2759 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 109822 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 27699 | |
| 1 | 19645 | |
| _ | 9233 | 8.4% |
| 2 | 5321 | 4.8% |
| T | 4830 | 4.4% |
| 4 | 4335 | 3.9% |
| 3 | 4012 | 3.7% |
| H | 3611 | 3.3% |
| E | 3546 | 3.2% |
| 8 | 2860 | 2.6% |
| Other values (24) | 24730 |
level3Name
Text
Missing 
| Distinct | 441 |
|---|---|
| Distinct (%) | 5.4% |
| Missing | 576369 |
| Missing (%) | 98.6% |
| Memory size | 4.5 MiB |
Length
| Max length | 30 |
|---|---|
| Median length | 24 |
| Mean length | 8.994041104 |
| Min length | 3 |
Unique
| Unique | 158 ? |
|---|---|
| Unique (%) | 1.9% |
Sample
| 1st row | Chintapalle |
|---|---|
| 2nd row | Kwakoa |
| 3rd row | Paranesti |
| 4th row | Abaya |
| 5th row | Bio Jiganifado |
| Value | Count | Frequency (%) |
| bio | 547 | 4.8% |
| jiganifado | 547 | 4.8% |
| abaya | 499 | 4.4% |
| zuria | 483 | 4.3% |
| gambela | 464 | 4.1% |
| hembeti | 457 | 4.0% |
| quimbiri | 329 | 2.9% |
| kisarawe | 306 | 2.7% |
| gewane | 289 | 2.5% |
| lome | 277 | 2.4% |
| Other values (560) | 7143 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 11829 | |
| i | 6915 | 9.3% |
| e | 5698 | 7.7% |
| o | 3716 | 5.0% |
| n | 3653 | 4.9% |
| 3118 | 4.2% | |
| r | 3027 | 4.1% |
| m | 2665 | 3.6% |
| u | 2447 | 3.3% |
| b | 2389 | 3.2% |
| Other values (75) | 28501 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 58838 | |
| Uppercase Letter | 10778 | 14.6% |
| Space Separator | 3118 | 4.2% |
| Decimal Number | 463 | 0.6% |
| Other Punctuation | 400 | 0.5% |
| Open Punctuation | 158 | 0.2% |
| Close Punctuation | 158 | 0.2% |
| Dash Punctuation | 45 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 11829 | |
| i | 6915 | |
| e | 5698 | |
| o | 3716 | 6.3% |
| n | 3653 | 6.2% |
| r | 3027 | 5.1% |
| m | 2665 | 4.5% |
| u | 2447 | 4.2% |
| b | 2389 | 4.1% |
| t | 2369 | 4.0% |
| Other values (31) | 14130 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 998 | 9.3% |
| K | 921 | 8.5% |
| B | 904 | 8.4% |
| G | 846 | 7.8% |
| L | 711 | 6.6% |
| H | 671 | 6.2% |
| J | 566 | 5.3% |
| N | 547 | 5.1% |
| Z | 518 | 4.8% |
| C | 504 | 4.7% |
| Other values (16) | 3592 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 112 | |
| 3 | 98 | |
| 7 | 95 | |
| 1 | 62 | |
| 2 | 32 | 6.9% |
| 0 | 23 | 5.0% |
| 5 | 16 | 3.5% |
| 9 | 13 | 2.8% |
| 6 | 11 | 2.4% |
| 8 | 1 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 306 | |
| , | 80 | 20.0% |
| ' | 13 | 3.2% |
| / | 1 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 3118 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 158 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 158 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 45 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 69616 | |
| Common | 4342 | 5.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 11829 | |
| i | 6915 | 9.9% |
| e | 5698 | 8.2% |
| o | 3716 | 5.3% |
| n | 3653 | 5.2% |
| r | 3027 | 4.3% |
| m | 2665 | 3.8% |
| u | 2447 | 3.5% |
| b | 2389 | 3.4% |
| t | 2369 | 3.4% |
| Other values (57) | 24908 |
Common
| Value | Count | Frequency (%) |
| 3118 | ||
| . | 306 | 7.0% |
| ( | 158 | 3.6% |
| ) | 158 | 3.6% |
| 4 | 112 | 2.6% |
| 3 | 98 | 2.3% |
| 7 | 95 | 2.2% |
| , | 80 | 1.8% |
| 1 | 62 | 1.4% |
| - | 45 | 1.0% |
| Other values (8) | 110 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 73811 | |
| None | 135 | 0.2% |
| Latin Ext Additional | 12 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 11829 | |
| i | 6915 | 9.4% |
| e | 5698 | 7.7% |
| o | 3716 | 5.0% |
| n | 3653 | 4.9% |
| 3118 | 4.2% | |
| r | 3027 | 4.1% |
| m | 2665 | 3.6% |
| u | 2447 | 3.3% |
| b | 2389 | 3.2% |
| Other values (60) | 28354 |
None
| Value | Count | Frequency (%) |
| í | 46 | |
| â | 21 | |
| ó | 16 | 11.9% |
| è | 10 | 7.4% |
| ê | 9 | 6.7% |
| ñ | 9 | 6.7% |
| á | 9 | 6.7% |
| ơ | 7 | 5.2% |
| ü | 4 | 3.0% |
| ư | 4 | 3.0% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ả | 3 | |
| ế | 3 | |
| ạ | 3 | |
| ờ | 2 | |
| ệ | 1 | 8.3% |
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 273793 |
| Missing (%) | 46.8% |
| Memory size | 4.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | LC |
|---|---|
| 2nd row | LC |
| 3rd row | LC |
| 4th row | LC |
| 5th row | LC |
| Value | Count | Frequency (%) |
| lc | 259391 | |
| ne | 22703 | 7.3% |
| nt | 14823 | 4.8% |
| vu | 8832 | 2.8% |
| en | 3006 | 1.0% |
| cr | 1367 | 0.4% |
| ex | 575 | 0.2% |
| dd | 71 | < 0.1% |
| ew | 31 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 260758 | |
| L | 259391 | |
| N | 40532 | 6.5% |
| E | 26315 | 4.2% |
| T | 14823 | 2.4% |
| V | 8832 | 1.4% |
| U | 8832 | 1.4% |
| R | 1367 | 0.2% |
| X | 575 | 0.1% |
| D | 142 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 621598 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 260758 | |
| L | 259391 | |
| N | 40532 | 6.5% |
| E | 26315 | 4.2% |
| T | 14823 | 2.4% |
| V | 8832 | 1.4% |
| U | 8832 | 1.4% |
| R | 1367 | 0.2% |
| X | 575 | 0.1% |
| D | 142 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 621598 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 260758 | |
| L | 259391 | |
| N | 40532 | 6.5% |
| E | 26315 | 4.2% |
| T | 14823 | 2.4% |
| V | 8832 | 1.4% |
| U | 8832 | 1.4% |
| R | 1367 | 0.2% |
| X | 575 | 0.1% |
| D | 142 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 621598 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 260758 | |
| L | 259391 | |
| N | 40532 | 6.5% |
| E | 26315 | 4.2% |
| T | 14823 | 2.4% |
| V | 8832 | 1.4% |
| U | 8832 | 1.4% |
| R | 1367 | 0.2% |
| X | 575 | 0.1% |
| D | 142 | < 0.1% |